DocumentCode :
2101968
Title :
GPU-based multifrontal optimizing method in sparse Cholesky factorization
Author :
Zheng, Ran ; Wang, Wei ; Jin, Hai ; Wu, Song ; Chen, Yong ; Jiang, Han
Author_Institution :
Services Computing Technology and System Lab, Cluster and Grid Computing Lab, School of Computer Science and Technology, Huazhong University of Science and Technology, Wuhan, 430074, China
fYear :
2015
fDate :
27-29 July 2015
Firstpage :
90
Lastpage :
97
Abstract :
In many scientific computing applications, sparse Cholesky factorization is used to solve large sparse linear equations in distributed environment. GPU computing is a new way to solve the problem. However, sparse Cholesky factorization on GPU is hardly to achieve excellent performance due to the structure irregularity of matrix and the low GPU resource utilization. A hybrid CPU-GPU implementation of sparse Cholesky factorization is proposed based on multifrontal method. A large sparse coefficient matrix is decomposed into a series of small dense matrices (frontal matrices) in the method, and then multiple GEMM (General Matrix-matrix Multiplication) operations are computed. GEMMs are the main operations in sparse Cholesky factorization, but they are hardly to perform better in parallel on GPU. In order to improve the performance, the scheme of multiple task queues is adopted when performing multiple GEMMs parallelized with multifrontal method; all GEMM tasks are scheduled dynamically on GPU and CPU based on computation scales for load balance and computing-time reduction. Experimental results show that the approach can outperform the implementations of BLAS and cuBLAS, achieving up to 3.15× and 1.98× speedup, respectively.
Keywords :
Acceleration; Graphics processing units; Instruction sets; Kernel; Matrix decomposition; Sparse matrices; Symmetric matrices; GPU Acceleration; Multifrontal method; Multiple task queues scheme; Task allocation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Application-specific Systems, Architectures and Processors (ASAP), 2015 IEEE 26th International Conference on
Conference_Location :
Toronto, ON, Canada
Type :
conf
DOI :
10.1109/ASAP.2015.7245714
Filename :
7245714
Link To Document :
بازگشت