Title :
Performance and scalability of preconditioned conjugate gradient methods on parallel computers
Author :
Gupta, Anshul ; Kumar, Vipin ; Sameh, Ahmed
Author_Institution :
Dept. of Comput. Sci., Minnesota Univ., Minneapolis, MN, USA
fDate :
5/1/1995 12:00:00 AM
Abstract :
This paper analyzes the performance and scalability of an iteration of the preconditioned conjugate gradient algorithm on parallel architectures with a variety of interconnection networks, such as the mesh, the hypercube, and that of the CM-5 parallel computer. It is shown that for block-tridiagonal matrices resulting from two-dimensional finite difference grids, the communication overhead due to vector inner products dominates the communication overheads of the remainder of the computation on a large number of processors. However, with a suitable mapping, the parallel formulation of a PCG iteration is highly scalable for such matrices on a machine like the CM-5 whose fast control network practically eliminates the overheads due to inner product computation. The use of the truncated Incomplete Cholesky (IC) preconditioner can lead to further improvement in scalability on the CM-5 by a constant factor,as a result, a parallel formulation of the PCG algorithm with IC preconditioner may execute faster than that with a simple diagonal preconditioner even if the latter runs faster in a serial implementation
Keywords :
computational complexity; conjugate gradient methods; parallel algorithms; parallel architectures; sparse matrices; PCG iteration; block-tridiagonal matrices; interconnection networks; iteration; parallel architectures; parallel computers; parallel formulation; preconditioned conjugate gradient methods; scalability; scalable; Algorithm design and analysis; Computer networks; Concurrent computing; Finite difference methods; Grid computing; Hypercubes; Multiprocessor interconnection networks; Parallel architectures; Performance analysis; Scalability;
Journal_Title :
Parallel and Distributed Systems, IEEE Transactions on