Title :
GPU-accelerated scalable solver for banded linear systems
Author :
Hang Liu ; Jung-Hee Seo ; Mittal, Riya ; Huang, He Helen
Author_Institution :
George Washington Univ., Washington, DC, USA
Abstract :
Solving a banded linear system efficiently is important to many scientific and engineering applications. Current solvers achieve good scalability only on the linear systems that can be partitioned into independent subsystems. In this paper, we present a GPU based, scalable Bi-Conjugate Gradient Stabilized solver that can be used to solve a wide range of banded linear systems. We utilize a row-oriented matrix decomposition method to divide the banded linear system into several correlated sub-linear systems and solve them on multiple GPUs collaboratively. We design a number of GPU and MPI optimizations to speedup inter-GPU and inter-machine communications. We evaluate the solver on Poisson equation and advection diffusion equation as well as several other banded linear systems. The solver achieves a speedup of more than 21 times running from 6 to 192 GPUs on the XSEDE´s Keeneland supercomputer and because of small communication overhead, can scale upto 32 GPUs on Amazon EC2 with relatively slow ethernet network.
Keywords :
Poisson equation; application program interfaces; graphics processing units; linear systems; local area networks; system buses; Ethernet network; GPU accelerated scalable solver; MPI optimizations; Poisson equation; XSEDE´s Keeneland supercomputer; advection diffusion equation; banded linear systems; independent subsystems; intermachine communications; row oriented matrix decomposition method; scalable biconjugate gradient stabilized solver; sublinear systems; Computers; Correlation; Data transfer; Educational institutions; Graphics processing units; Matrix decomposition; Scalability;
Conference_Titel :
Cluster Computing (CLUSTER), 2013 IEEE International Conference on
Conference_Location :
Indianapolis, IN
DOI :
10.1109/CLUSTER.2013.6702612