Title :
Tuning matrix-vector multiplication on GPU
Author :
Dziekonski, Adam ; Mrozowski, Michal
Author_Institution :
Dept. of Microwave & Antenna Eng., Gdansk Univ. of Technol., Gdansk, Poland
Abstract :
A matrix times vector multiplication (matvec) is a cornerstone operation in iterative methods of solving large sparse systems of equations such as the conjugate gradients method (cg), the minimal residual method (minres), the generalized residual method (gmres) and exerts an influence on overall performance of those methods. An implementation of matvec is particularly demanding when one executes computations on a GPU (Graphics Processing Unit), because using this device one has to comply with certain programming rules in order to take advantage of parallel computing. In this paper, it will be shown how to modify the sparse matrix-vector multiplication based on CRS (Compressed Row Storage) to achieve about 3-5 times better performance on - a low cost - GPU (GeForce GTX 285, 1.48 GHz) than on a CPU (Intel Core i7, 2.67 GHz).
Keywords :
computer graphic equipment; coprocessors; iterative methods; matrix multiplication; sparse matrices; vectors; CRS; GPU; compressed row storage; conjugate gradients method; cornerstone operation; generalized residual method; graphics processing unit; iterative methods; large sparse systems; matrix-vector multiplication; minimal residual method; parallel computing; programming rules; Acceleration; Computer architecture; Graphics; Graphics processing unit; Instruction sets; Kernel; CUDA; GPU; Sparse Matrix times Vector multiplication (SpMV);
Conference_Titel :
Information Technology (ICIT), 2010 2nd International Conference on
Conference_Location :
Gdansk
Print_ISBN :
978-1-4244-8182-8