Title :
Fast sparse matrix-vector multiplication on graphics processing unit for finite element analysis
Author :
Cheik Ahamed, Abal-Kassim ; Magoules, Frederic
Author_Institution :
Appl. Math. & Syst. Lab., Ecole Centrale Paris, Chatenay-Malabry, France
Abstract :
Finite element analysis involves the solution of linear systems described by large size sparse matrices. Iterative Krylov methods are well suited for such type of problems. These methods require linear algebra operations, including sparse matrix-vector multiplication which can be computationally expensive for large size matrices. In this paper, we present the best way to perform these operations, in double precision, on Graphics Processing Unit (GPU). Several linear algebra libraries are considered and compared to our proper implementation. These libraries and our proper implementation are then integrated within an iterative Krylov method on the GPU. Numerical experiments done on a set of finite element matrices are presented and illustrate the performance, robustness and accuracy of our proper implementation compared to the existing libraries and its suitability for finite element analysis. Dynamic tuning of the gridification, upon the GPU architecture and the finite element matrix characteristics, is finally applied to faster the sparse matrix-vector multiplication operation.
Keywords :
finite element analysis; graphics processing units; iterative methods; mathematics computing; matrix multiplication; sparse matrices; GPU; fast sparse matrix-vector multiplication; finite element analysis; finite element matrices; graphics processing unit; gridification dynamic tuning; iterative Krylov method; large size sparse matrices; linear algebra libraries; linear algebra operations; linear systems; Finite element methods; Graphics processing unit; Instruction sets; Kernel; Libraries; Sparse matrices; Symmetric matrices; Finite element analysis; dynamic tuning; graphics processing unit; gridification; iterative methods; linear algebra; sparse matrix-vector multiplication;
Conference_Titel :
High Performance Computing and Communication & 2012 IEEE 9th International Conference on Embedded Software and Systems (HPCC-ICESS), 2012 IEEE 14th International Conference on
Conference_Location :
Liverpool
Print_ISBN :
978-1-4673-2164-8
DOI :
10.1109/HPCC.2012.193