DocumentCode :
3266111
Title :
Optimizing sparse matrix-vector multiplication on CUDA
Author :
Wang, Zhuowei ; Xu, Xianbin ; Zhao, Wuqing ; Zhang, Yuping ; He, Shuibing
Author_Institution :
Sch. of Comput., Wuhan Univ., Wuhan, China
Volume :
4
fYear :
2010
fDate :
22-24 June 2010
Abstract :
In recent years, GPUs have attracted the attention of many application developers as powerful massively parallel system. CUDA as a general purpose parallel computing architecture make GPUs an appealing choice to solve many complex computational problems in a more efficient way. In this paper, we discuss implementing optimizing spare matrix-vector multiplication on NVIDIA GPUs using CUDA programming model. We outline three optimizations include: (1) optimized CSR storage format, (2) optimized threads mapping, and (3) avoiding divergence judgment. We experimentally evaluate our optimizations on GeForce 9600 GTX, connect to Windows xp 64-bit system. In comparison with NVIDIA´s SpMV library and NVIDIA´s CUDDPA library, the results show that optimizing sparse matrix-vector multiplication on CUDA achieves better performance than other SpMV implementations.
Keywords :
microprocessor chips; parallel processing; performance evaluation; CUDA programming model; GeForce 9600 GTX; NVIDIA GPU; Windows xp 64-bit system; complex computational problem; general purpose parallel computing architecture; optimized CSR storage format; optimized thread mapping; powerful massively parallel system; sparse matrix-vector multiplication; Application software; Computer science education; Concurrent computing; Educational institutions; Graphics processing unit; Kernel; Libraries; Parallel processing; Parallel programming; Sparse matrices; Yarn; CUDA; GPUs; NVIDIA´s CUDDPA library; NVIDIA´s SpMV library; SpMV;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Education Technology and Computer (ICETC), 2010 2nd International Conference on
Conference_Location :
Shanghai
Print_ISBN :
978-1-4244-6367-1
Type :
conf
DOI :
10.1109/ICETC.2010.5529724
Filename :
5529724
Link To Document :
بازگشت