DocumentCode :
3111131
Title :
Communication Avoiding Gaussian elimination
Author :
Grigori, Laura ; Demmel, James ; Xiang, Hia
Author_Institution :
INRIA Saclay-Ile de France, Univ. Paris-Sud 11, Orsay, France
fYear :
2008
fDate :
15-21 Nov. 2008
Firstpage :
1
Lastpage :
12
Abstract :
We present CALU, a Communication Avoiding algorithm for the LU factorization of dense matrices distributed in a two-dimensional cyclic layout. The algorithm is based on a new pivoting strategy, which is stable in practice. The new algorithm is optimal (up to polylogarithmic factors) in the amount of communication it performs. Our experiments show that CALU leads to a reduction in the parallel time, in particular when the latency time is an important factor of the overall time. The factorization of a block-column, a subroutine of CALU, outperforms the corresponding routine PDGETF2 from ScaLAPACK up to a factor of 4.37 on an IBM POWER 5 system and up to a factor of 5.58 on a Cray XT4 system. On square matrices of order 104, CALU outperforms the corresponding routine PDGETRF from ScaLAPACK by a factor of 1.24 on IBM POWER 5 and by a factor of 1.31 on Cray XT4.
Keywords :
matrix decomposition; parallel algorithms; Cray XT4 system; Gaussian elimination; IBM POWER 5 system; PDGETF2; ScaLAPACK; block-column factorization; communication avoiding algorithm; dense matrices factorization; latency time; two-dimensional cyclic layout; Algorithms; Delay;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
High Performance Computing, Networking, Storage and Analysis, 2008. SC 2008. International Conference for
Conference_Location :
Austin, TX
Print_ISBN :
978-1-4244-2834-2
Electronic_ISBN :
978-1-4244-2835-9
Type :
conf
DOI :
10.1109/SC.2008.5214287
Filename :
5214287
Link To Document :
بازگشت