DocumentCode :
1295766
Title :
Algorithmic redistribution methods for block-cyclic decompositions
Author :
Petitet, Antoine P. ; Dongarra, Jack J.
Author_Institution :
Dept. of Comput. Sci., Tennessee Univ., Knoxville, TN, USA
Volume :
10
Issue :
12
fYear :
1999
fDate :
12/1/1999 12:00:00 AM
Firstpage :
1201
Lastpage :
1216
Abstract :
This article presents various data redistribution methods for block-partitioned linear algebra algorithms operating on dense matrices that are distributed in a block-cyclic fashion. Because the algorithmic partitioning unit and the distribution blacking factor are most often chosen to be equal, severe alignment restrictions are induced on the operands, and optimal values with respect to performance are architecture dependent. The techniques presented in this paper redistribute data “on the fly,” so that the user´s data distribution blocking factor becomes independent from the architecture dependent algorithmic partitioning. These techniques are applied to the matrix-matrix multiplication operation. A performance analysis along with experimental results shows that alignment restrictions can then be removed and that high performance can be maintained across platforms independently from the user´s data distribution blocking factor
Keywords :
matrix decomposition; parallel algorithms; alignment restrictions; block-partitioned linear algebra; data redistribution; dense matrices; matrix-matrix multiplication; Algorithm design and analysis; Concurrent computing; Distributed computing; Linear algebra; Matrix decomposition; Parallel algorithms; Partitioning algorithms; Performance analysis; Scalability; Software libraries;
fLanguage :
English
Journal_Title :
Parallel and Distributed Systems, IEEE Transactions on
Publisher :
ieee
ISSN :
1045-9219
Type :
jour
DOI :
10.1109/71.819944
Filename :
819944
Link To Document :
بازگشت