Scalable linear algebra software libraries for distributed memory concurrent computers

Author

Choi, Jaeyoung ; Dongarra, Jack J.

Author_Institution

Sch. of Comput., Soongsil Univ., Seoul, South Korea

fYear

1995

fDate

28-30 Aug 1995

Firstpage

170

Lastpage

177

Abstract

This paper discusses the core factorization routines included in the ScaLAPACK library. These routines allow the factorization and solution of a dense system of linear equations via LU, QR, and Cholesky. They are implemented using a block cyclic data distribution, and are built using de facto standard kernels for matrix and vector operations (BLAS and its parallel counterpart PBLAS) and message passing communication (BLACS). In implementing the ScaLAPACK routines, a major objective was to parallelize the corresponding sequential LAPACK using the BLAS, BLACS, and PBLAS as building blocks, leading to straightforward parallel implementations without a significant loss in performance. This paper discusses the design of ScaLAPACK a scalable software library for performing dense and banded linear algebra computations on distributed memory concurrent computers. They are implemented using a block cyclic data distribution, and are built using de facto standard kernels for matrix and vector operations (BLAS and its parallel counterpart PBLAS) and message passing communication (BLACS). In implementing the ScaLAPACK routines, a major objective was to parallelize the corresponding sequential LAPACK using the BLAS, BLACS, and PBLAS as building blocks, leading to straightforward parallel implementations without a significant loss in performance. We present the details of the implementation of the ScaLAPACK LU factorization routine, and performance and scalability results on the Intel iPSC/860, Touchstone Delta, and Paragon systems

Keywords

message passing; parallel programming; software libraries; software reusability; symbol manipulation; Intel iPSC/860; Paragon systems; ScaLAPACK library; Touchstone Delta; block cyclic data distribution; core factorization routines; de facto standard kernels; distributed memory concurrent computers; matrix operations; message passing communication; parallel implementations; scalable linear algebra software libraries; scalable software library; vector operations; Communication standards; Concurrent computing; Distributed computing; Equations; Kernel; Linear algebra; Message passing; Performance loss; Software libraries; Vectors;

fLanguage

English

Publisher

ieee

Conference_Titel

Distributed Computing Systems, 1995., Proceedings of the Fifth IEEE Computer Society Workshop on Future Trends of

Conference_Location

Cheju Island

Print_ISBN

0-8186-7125-4

Type

conf

DOI

10.1109/FTDCS.1995.524982

Filename

524982