Title : 
A Study on the Implementation of Tridiagonal Systems Solvers Using a GPU
         
        
            Author : 
Alfaro, P. ; Igounet, P. ; Ezzatti, Pablo
         
        
            Author_Institution : 
Inst. de Comput., Univ. de la Republica Montevideo, Montevideo, Uruguay
         
        
        
        
        
        
            Abstract : 
In recent years the use of secondary hardware to accelerate computation of general-purpose problems has emerged as an alternative to traditional high performance computing (HPC) hardware. Specially, the use of graphics processors (GPUs) in the field of HPC has grown given their inherent parallel architecture and low cost. In a previous work, we have studied a preliminary implementation of the cyclic reduction method to tackle tridiagonal linear systems. In this article, we improve our previous implementation in order to accelerate the tridiagonal solvers on GPU using efficient memory techniques, such as pinned memory and coalesced access. The article also presents the implementation of parallel cyclic reduction method on GPU. We analyze and implement several methods for solving tridiagonal systems on GPUs. These implementations were evaluated on different hardware platforms, obtaining significant accelerations, allowing speedups of 3× on a NVIDIA C1060 GPU. The obtained results demonstrate that this new proposal can achieve significant speedup values when compared to an implementation of Thomas method on CPU and our previous GPU implementation.
         
        
            Keywords : 
graphics processing units; mathematics computing; matrix decomposition; parallel processing; CPU; HPC hardware; NVIDIA C1060 GPU; Thomas method; coalesced access; efficient memory techniques; general-purpose problems computation; graphics processors; high performance computing hardware; parallel architecture; parallel cyclic reduction method; pinned memory; secondary hardware; tridiagonal linear systems; tridiagonal systems solvers; Equations; Graphics processing units; Hardware; Instruction sets; Linear systems; Matrices; Vectors; GPU; cyclic reduction; tridiagonal systems;
         
        
        
        
            Conference_Titel : 
Computer Science Society (SCCC), 2011 30th International Conference of the Chilean
         
        
            Conference_Location : 
Curico
         
        
        
            Print_ISBN : 
978-1-4673-1364-3
         
        
        
            DOI : 
10.1109/SCCC.2011.29