Title :
Portable parallel Level-3 BLAS in Linda
Author :
Ghosh, Bhaskar ; Schultz, Martin H.
Author_Institution :
Dept. of Comput. Sci., Yale Univ., New Haven, CT, USA
Abstract :
Describes an approach towards providing an efficient Level-3 BLAS library over a variety of parallel architectures using C-Linda. A blocked linear algebra program calling the sequential Level-3 BLAS can now run on both shared and distributed memory environments (which support Linda) by simply replacing each call by a call to the corresponding parallel Linda Level-3 BLAS. The authors summarise some of the implementation and algorithmic issues related to the matrix multiplication subroutine. All the various matrix algorithms being block-structured, they are particularly interested in parallel computers with hierarchical memory systems. Experimental data for their implementations show substantial speedups on shared memory, disjoint memory and networked configurations of processors. The authors also present the use of their parallel subroutines in blocked dense LU decomposition and present some preliminary experimental data
Keywords :
linear algebra; parallel algorithms; software portability; Basin Linear Algebra Subroutines; C-Linda; Level-3 BLAS library; algorithmic issues; blocked dense LU decomposition; blocked linear algebra program; disjoint memory; distributed memory environments; hierarchical memory systems; matrix multiplication subroutine; networked configurations; parallel architectures; portable parallel software; shared memory environments; speedups; Algorithms; Computer architecture; Computer science; Concurrent computing; Libraries; Linear algebra; Matrix decomposition; Parallel machines; Portable computers; Scientific computing;
Conference_Titel :
Scalable High Performance Computing Conference, 1992. SHPCC-92, Proceedings.
Conference_Location :
Williamsburg, VA
Print_ISBN :
0-8186-2775-1
DOI :
10.1109/SHPCC.1992.232664