Title :
Scalable linear algebra software libraries for distributed memory concurrent computers
Author :
Choi, Jaeyoung ; Dongarra, Jack J.
Author_Institution :
Sch. of Comput., Soongsil Univ., Seoul, South Korea
Abstract :
This paper discusses the core factorization routines included in the ScaLAPACK library. These routines allow the factorization and solution of a dense system of linear equations via LU, QR, and Cholesky. They are implemented using a block cyclic data distribution, and are built using de facto standard kernels for matrix and vector operations (BLAS and its parallel counterpart PBLAS) and message passing communication (BLACS). In implementing the ScaLAPACK routines, a major objective was to parallelize the corresponding sequential LAPACK using the BLAS, BLACS, and PBLAS as building blocks, leading to straightforward parallel implementations without a significant loss in performance. This paper discusses the design of ScaLAPACK a scalable software library for performing dense and banded linear algebra computations on distributed memory concurrent computers. They are implemented using a block cyclic data distribution, and are built using de facto standard kernels for matrix and vector operations (BLAS and its parallel counterpart PBLAS) and message passing communication (BLACS). In implementing the ScaLAPACK routines, a major objective was to parallelize the corresponding sequential LAPACK using the BLAS, BLACS, and PBLAS as building blocks, leading to straightforward parallel implementations without a significant loss in performance. We present the details of the implementation of the ScaLAPACK LU factorization routine, and performance and scalability results on the Intel iPSC/860, Touchstone Delta, and Paragon systems
Keywords :
message passing; parallel programming; software libraries; software reusability; symbol manipulation; Intel iPSC/860; Paragon systems; ScaLAPACK library; Touchstone Delta; block cyclic data distribution; core factorization routines; de facto standard kernels; distributed memory concurrent computers; matrix operations; message passing communication; parallel implementations; scalable linear algebra software libraries; scalable software library; vector operations; Communication standards; Concurrent computing; Distributed computing; Equations; Kernel; Linear algebra; Message passing; Performance loss; Software libraries; Vectors;
Conference_Titel :
Distributed Computing Systems, 1995., Proceedings of the Fifth IEEE Computer Society Workshop on Future Trends of
Conference_Location :
Cheju Island
Print_ISBN :
0-8186-7125-4
DOI :
10.1109/FTDCS.1995.524982