Implementing the Jacobi Algorithm for Solving Eigenvalues of Symmetric Matrices with CUDA

Author

Tao Wang ; Longjiang Guo ; Guilin Li ; Jinbao Li ; Renda Wang ; Meirui Ren ; Jing He

Author_Institution

Sch. of Comput. Sci. & Technol., Heilongjiang Univ., Harbin, China

fYear

2012

fDate

28-30 June 2012

Firstpage

69

Lastpage

78

Abstract

Solving the eigenvalues of matrices is an open problem which is often related to scientific computation. With the increasing of the order of matrices, traditional sequential algorithms are unable to meet the needs for the calculation time. Although people can use cluster systems in a short time to solve the eigenvalues of large-scale matrices, it will bring an increase in equipment costs and power consumption. This paper proposes a parallel algorithm named Jacobi on gpu which is implemented by CUDA (Computer Unified Device Architecture) on GPU (Graphic Process Unit) to solve the eigenvalues of symmetric matrices. In our experimental environment, we have Intel Core i5-760 quad-core CPU, NVIDIA GeForce GTX460 card, and Win7 64-bit operating system. When the size of matrix is 10240×10240, the number of iterations is 10000 times, the speedup ratio is 13.71. As the size of matrices increase, the speedup ratio increases correspondingly. Moreover, as the number of iterations increases, the speedup ratio is very stable. When the size of matrix is 8192×8192, the number of iterations are 1000, 2000, 4000, 8000 and 16000 respectively, the standard deviation of the speedup ratio is 0.1161. The experimental results show that the Jacobi on gpu algorithm can save more running time than traditional sequential algorithms and the speedup ratio is 3.02~13.71. Therefore, the computing time of traditional sequential algorithms to solve the eigenvalues of matrices is reduced significantly.

Keywords

Jacobian matrices; eigenvalues and eigenfunctions; graphics processing units; mathematics computing; parallel algorithms; parallel architectures; pattern clustering; CUDA; Intel Core i5-760 quad-core CPU; Jacobi algorithm; NVIDIA GeForce GTX460 card; Win7 64-bit operating system; cluster systems; computer unified device architecture; eigenvalues; graphic process unit; large-scale matrices; parallel algorithm; sequential algorithms; symmetric matrices; Arrays; Eigenvalues and eigenfunctions; Graphics processing unit; Indexes; Jacobian matrices; Kernel; Symmetric matrices; CUDA; GPU; Jacobi iteration; Matrix eigenvalue; Symmetric Matrix;

fLanguage

English

Publisher

ieee

Conference_Titel

Networking, Architecture and Storage (NAS), 2012 IEEE 7th International Conference on

Conference_Location

Xiamen, Fujian

Print_ISBN

978-1-4673-1889-1

Type

conf

DOI

10.1109/NAS.2012.12

Filename

6310877