مرکز منطقه ای اطلاع رساني علوم و فناوري - Performance and scalability of preconditioned conjugate gradient methods on parallel computers

DocumentCode :

775025

Title :

Performance and scalability of preconditioned conjugate gradient methods on parallel computers

Author :

Gupta, Anshul ; Kumar, Vipin ; Sameh, Ahmed

Author_Institution :

Dept. of Comput. Sci., Minnesota Univ., Minneapolis, MN, USA

Volume :

Issue :

fYear :

1995

fDate :

5/1/1995 12:00:00 AM

Firstpage :

455

Lastpage :

469

Abstract :

This paper analyzes the performance and scalability of an iteration of the preconditioned conjugate gradient algorithm on parallel architectures with a variety of interconnection networks, such as the mesh, the hypercube, and that of the CM-5 parallel computer. It is shown that for block-tridiagonal matrices resulting from two-dimensional finite difference grids, the communication overhead due to vector inner products dominates the communication overheads of the remainder of the computation on a large number of processors. However, with a suitable mapping, the parallel formulation of a PCG iteration is highly scalable for such matrices on a machine like the CM-5 whose fast control network practically eliminates the overheads due to inner product computation. The use of the truncated Incomplete Cholesky (IC) preconditioner can lead to further improvement in scalability on the CM-5 by a constant factor,as a result, a parallel formulation of the PCG algorithm with IC preconditioner may execute faster than that with a simple diagonal preconditioner even if the latter runs faster in a serial implementation

Keywords :

computational complexity; conjugate gradient methods; parallel algorithms; parallel architectures; sparse matrices; PCG iteration; block-tridiagonal matrices; interconnection networks; iteration; parallel architectures; parallel computers; parallel formulation; preconditioned conjugate gradient methods; scalability; scalable; Algorithm design and analysis; Computer networks; Concurrent computing; Finite difference methods; Grid computing; Hypercubes; Multiprocessor interconnection networks; Parallel architectures; Performance analysis; Scalability;

fLanguage :

English

Journal_Title :

Parallel and Distributed Systems, IEEE Transactions on

Publisher :

ieee

ISSN :

1045-9219

Type :

jour

DOI :

10.1109/71.382315

Filename :

382315

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=775025