DocumentCode
1858274
Title
A Scalable Tridiagonal Solver for GPUs
Author
Kim, Hee-Seok ; Wu, Shengzhao ; Chang, Li-Wen ; Hwu, Wen-Mei W.
Author_Institution
Electr. & Comput. Eng., Univ. of Illinois at Urbana-Champaign, Urbana, IL, USA
fYear
2011
fDate
13-16 Sept. 2011
Firstpage
444
Lastpage
453
Abstract
We present the design and evaluation of a scalable tridiagonal solver targeted for GPU architectures. We observed that two distinct steps are required to solve a large tridiagonal system in parallel: 1) breaking down a problem into multiple sub problems each of which is independent of other, and 2) solving the sub problems using an efficient algorithm. We propose a hybrid method of tiled parallel cyclic reduction(tiled PCR) and thread-level parallel Thomas algorithm(p-Thomas). Algorithm transition from tiled PCR to p-Thomas is determined by input system size and hardware capability in order to achieve optimal performance. The proposed method is scalable as it can cope with various input system sizes by properly adjusting algorithm trasition point. Our method on a NVidia GTX480 shows up to 8.3x and 49x speedups over multithreaded and sequential MKL implementations on a 3.33GHz Intel i7 975 in double precision, respectively.
Keywords
computer graphic equipment; coprocessors; multi-threading; parallel algorithms; GPU architectures; Intel i7 975; NVidia GTX480; algorithm transition; algorithm trasition point; hardware capability; hybrid method; multithreaded MKL implementations; optimal performance; p-Thomas; scalable tridiagonal solver; sequential MKL implementations; thread-level parallel Thomas algorithm; tiled PCR; tiled parallel cyclic reduction; tridiagonal system; Computer architecture; Graphics processing unit; Instruction sets; Matrices; Parallel processing; Redundancy; Tiles; GPGPU; GPU Computing; Tridiagonal solver; Tridiagonal systems;
fLanguage
English
Publisher
ieee
Conference_Titel
Parallel Processing (ICPP), 2011 International Conference on
Conference_Location
Taipei City
ISSN
0190-3918
Print_ISBN
978-1-4577-1336-1
Electronic_ISBN
0190-3918
Type
conf
DOI
10.1109/ICPP.2011.41
Filename
6047212
Link To Document