Title :
Performance of Parallel Sparse Matrix-Vector Multiplications in Linear Solves on Multiple GPUs
Author :
Jamroz, Ben ; Mullowney, Paul
Author_Institution :
Tech-X Corp., Boulder, CO, USA
Abstract :
Modern numerical simulations often require solving extremely large sparse linear systems. Solving these linear systems using Krylov iterative methods requires repeated sparse matrix-vector multiplications which can be the most computationally expensive part of the simulation. Since Graphics Processing Units (GPUs) provide a significant increase in floating point operations per second and memory bandwidth over conventional Central Processing Units (CPUs), performing sparse matrix-vector multiplications with these co-processors can decrease the amount of time required to solve a given linear system. In this paper, we investigate the performance of sparse matrix-vector multiplications across multiple GPUs. This is performed in the context of the solution of symmetric positive-definite linear systems using a conjugate-gradient iteration preconditioned with a least-squares polynomial preconditioner using the PETSc library.
Keywords :
gradient methods; graphics processing units; least squares approximations; sparse matrices; CPU; Krylov iterative methods; PETSc library; central processing units; conjugate-gradient iteration; coprocessors; floating point operations; graphics processing units; least-squares polynomial preconditioner; memory bandwidth; multiple GPU; numerical simulations; parallel sparse matrix-vector multiplications; sparse linear systems; symmetric positive-definite linear systems; Approximation methods; Graphics processing unit; Linear systems; Performance evaluation; Polynomials; Sparse matrices; Vectors; graphics processing units; linear algebra; preconditioner;
Conference_Titel :
Application Accelerators in High Performance Computing (SAAHPC), 2012 Symposium on
Conference_Location :
Chicago IL
Print_ISBN :
978-1-4673-2882-1
DOI :
10.1109/SAAHPC.2012.27