Title :
Dense linear algebra solvers for multicore with GPU accelerators
Author :
Tomov, Stanimire ; Nath, Rajib ; Ltaief, Hatem ; Dongarra, Jack
Author_Institution :
Dept. of Electr. Eng. & Comput. Sci., Univ. of Tennessee, Knoxville, TN, USA
Abstract :
Solving dense linear systems of equations is a fundamental problem in scientific computing. Numerical simulations involving complex systems represented in terms of unknown variables and relations between them often lead to linear systems of equations that must be solved as fast as possible. We describe current efforts toward the development of these critical solvers in the area of dense linear algebra (DLA) for multicore with GPU accelerators. We describe how to code/develop solvers to effectively use the high computing power available in these new and emerging hybrid architectures. The approach taken is based on hybridization techniques in the context of Cholesky, LU, and QR factorizations. We use a high-level parallel programming model and leverage existing software infrastructure, e.g. optimized BLAS for CPU and GPU, and LAPACK for sequential CPU processing. Included also are architecture and algorithm-specific optimizations for standard solvers as well as mixed-precision iterative refinement solvers. The new algorithms, depending on the hardware configuration and routine parameters, can lead to orders of magnitude acceleration when compared to the same algorithms on standard multicore architectures that do not contain GPU accelerators. The newly developed DLA solvers are integrated and freely available through the MAGMA library.
Keywords :
coprocessors; linear algebra; mathematics computing; matrix decomposition; multiprocessing systems; optimisation; parallel programming; Cholesky factorization; GPU accelerators; LAPACK software; LU factorization; MAGMA library; QR factorization; algorithm-specific optimization; architecture-specific optimization; dense linear algebra solvers; graphics processing unit; hybridization techniques; multicore systems; optimized BLAS software; parallel programming model; Acceleration; Computer architecture; Equations; Iterative algorithms; Linear accelerators; Linear algebra; Linear systems; Multicore processing; Numerical simulation; Scientific computing; Dense Linear Algebra Solvers; GPU Accelerators; Hybrid Algorithms; MAGMA; Multicore;
Conference_Titel :
Parallel & Distributed Processing, Workshops and Phd Forum (IPDPSW), 2010 IEEE International Symposium on
Conference_Location :
Atlanta, GA
Print_ISBN :
978-1-4244-6533-0
DOI :
10.1109/IPDPSW.2010.5470941