DocumentCode :
3092027
Title :
The improved CGS method for large and sparse linear systems on bulk synchronous parallel architectures
Author :
Yang, Laurence Tianruo
Author_Institution :
Dept. of Comput. Sci., Saint Francis Xavier Univ., Antigonish, NS, Canada
fYear :
2002
fDate :
23-25 Oct. 2002
Firstpage :
232
Lastpage :
237
Abstract :
We propose an improved version of the conjugate gradient squared (CGS) method for the solutions of large and sparse linear systems of equations with unsymmetric coefficient matrices. The proposed method combines elements of numerical stability and parallel algorithm design without increasing computational costs. The algorithm is derived such that all matrix-vector multiplication, inner products and vector updates of a single iteration step are independent and the communication time required for inner product can be overlapped efficiently with the computation time of vector updates. Therefore, the cost of global communication which represents the bottleneck of the performance can be significantly reduced. In this paper, the bulk synchronous parallel model is used to design a fully efficient, scalable and portable parallel proposed algorithm and to provide accurate performance prediction of the algorithm for a wide range of architectures including the Cray T3D, the Parsytec, and a cluster of workstations connected by an Ethernet. This performance model uses only a few system dependent parameters based on a simple and accurate cost modelling to provide useful insight in the time complexity of the method. The theoretical performance prediction are compared with some preliminary measured timing results of a numerical application from the ocean flow simulation.
Keywords :
computational complexity; conjugate gradient methods; geophysics computing; iterative methods; linear systems; mathematics computing; parallel algorithms; parallel architectures; bulk-synchronous parallel model; complexity analysis; conjugate gradient squared method; iterative methods; large linear systems; ocean flow simulation; parallel algorithm; parallel architectures; unsymmetric coefficient matrices; Algorithm design and analysis; Clustering algorithms; Costs; Equations; Linear systems; Numerical stability; Parallel algorithms; Parallel architectures; Predictive models; Sparse matrices;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Algorithms and Architectures for Parallel Processing, 2002. Proceedings. Fifth International Conference on
Conference_Location :
Beijing, China
Print_ISBN :
0-7695-1512-6
Type :
conf
DOI :
10.1109/ICAPP.2002.1173579
Filename :
1173579
Link To Document :
بازگشت