Title :
LU Decomposition on GPUs: The Impact of Memory Access
Author :
Cupertino, Leandro F. ; Singulani, Anderson P. ; da Silva, C.P. ; Pacheco, Marco Aurélio C ; Farias, Ricardo
Author_Institution :
Dept. of Electr. Eng., PUC-Rio, Rio de Janeiro, Brazil
Abstract :
Graphics Processing Units (GPUs) are emerging as an attractive computing platform for general purpose computations due to their extremely high floating-point processing performance and their comparatively low cost. In the context of dense linear algebra, the LU decomposition represents a fundamental step in many computationally intensive scientific applications. The use of GPUs can accelerate the computation many times the speed of a single CPU. In this work, we investigate different implementations of the LU decomposition algorithm in a GPU. Our main goal is to parallelize the LU decomposition to fit the highly parallel architecture of modern GPUs, and to evaluate different types of memory access and their impact on the execution time of the algorithm. The results demonstrate that the memory access pattern can significantly impact the performance of the GPU implementation.
Keywords :
computer graphic equipment; coprocessors; floating point arithmetic; linear algebra; parallel architectures; GPU; LU decomposition; floating point processing; graphics processing unit; linear algebra; memory access; parallel architecture; Equations; Graphics processing unit; Indexes; Instruction sets; Kernel; Mathematical model; Matrix decomposition; Graphics Processing Units; LU decomposition; Memory Access;
Conference_Titel :
Computer Architecture and High Performance Computing Workshops (SBAC-PADW), 2010 22nd International Symposium on
Conference_Location :
Petropolis
Print_ISBN :
978-1-4244-8877-3
Electronic_ISBN :
978-0-7695-4276-8
DOI :
10.1109/SBAC-PADW.2010.10