DocumentCode
3383247
Title
Scalable linear algebra software libraries for distributed memory concurrent computers
Author
Choi, Jaeyoung ; Dongarra, Jack J.
Author_Institution
Sch. of Comput., Soongsil Univ., Seoul, South Korea
fYear
1995
fDate
28-30 Aug 1995
Firstpage
170
Lastpage
177
Abstract
This paper discusses the core factorization routines included in the ScaLAPACK library. These routines allow the factorization and solution of a dense system of linear equations via LU, QR, and Cholesky. They are implemented using a block cyclic data distribution, and are built using de facto standard kernels for matrix and vector operations (BLAS and its parallel counterpart PBLAS) and message passing communication (BLACS). In implementing the ScaLAPACK routines, a major objective was to parallelize the corresponding sequential LAPACK using the BLAS, BLACS, and PBLAS as building blocks, leading to straightforward parallel implementations without a significant loss in performance. This paper discusses the design of ScaLAPACK a scalable software library for performing dense and banded linear algebra computations on distributed memory concurrent computers. They are implemented using a block cyclic data distribution, and are built using de facto standard kernels for matrix and vector operations (BLAS and its parallel counterpart PBLAS) and message passing communication (BLACS). In implementing the ScaLAPACK routines, a major objective was to parallelize the corresponding sequential LAPACK using the BLAS, BLACS, and PBLAS as building blocks, leading to straightforward parallel implementations without a significant loss in performance. We present the details of the implementation of the ScaLAPACK LU factorization routine, and performance and scalability results on the Intel iPSC/860, Touchstone Delta, and Paragon systems
Keywords
message passing; parallel programming; software libraries; software reusability; symbol manipulation; Intel iPSC/860; Paragon systems; ScaLAPACK library; Touchstone Delta; block cyclic data distribution; core factorization routines; de facto standard kernels; distributed memory concurrent computers; matrix operations; message passing communication; parallel implementations; scalable linear algebra software libraries; scalable software library; vector operations; Communication standards; Concurrent computing; Distributed computing; Equations; Kernel; Linear algebra; Message passing; Performance loss; Software libraries; Vectors;
fLanguage
English
Publisher
ieee
Conference_Titel
Distributed Computing Systems, 1995., Proceedings of the Fifth IEEE Computer Society Workshop on Future Trends of
Conference_Location
Cheju Island
Print_ISBN
0-8186-7125-4
Type
conf
DOI
10.1109/FTDCS.1995.524982
Filename
524982
Link To Document