DocumentCode
426872
Title
Sparse LU Factorization with Partial Pivoting on Distributed Memory Machines
Author
Fu, Cong ; Yang, Tao
Author_Institution
University of California, Santa Barbara
fYear
1996
fDate
1996
Firstpage
31
Lastpage
31
Abstract
Sparse LU factorization with partial pivoting is important to many scientific applications, but the effective parallelization of this algorithm is still an open problem. The main difficulty is that partial pivoting operations make structures of L and U factors unpredictable beforehand. This paper presents a novel approach called S* for parallelizing this problem on distributed memory machines. S* incorporates static symbolic factorization to avoid run-time control overhead and uses nonsymmetric L/U supernode partitioning and amalgamation strategies to maximize the use of BLAS-3 routines. The irregular task parallelism embedded in sparse LU is exploited using the RAPID run-time system which optimizes asynchronous communication and task scheduling. The experimental results on the Cray0-T3D with a set of Harwell-Boeing nonsymmetric matrices are very encouraging and good scalability has been achieved. Even compared to a highly optimized sequential code, the parallel speedups are still impressive considering the current status of sparse LU research.
Keywords
Column partial pivoting; Dense structures; Irregular parallelism; Run-time support; Sparse LU factorization; Symbolic factorization; Task scheduling; Asynchronous communication; Communication system control; Runtime; Scalability; Sparse matrices; Column partial pivoting; Dense structures; Irregular parallelism; Run-time support; Sparse LU factorization; Symbolic factorization; Task scheduling;
fLanguage
English
Publisher
ieee
Conference_Titel
Supercomputing, 1996. Proceedings of the 1996 ACM/IEEE Conference on
Print_ISBN
0-89791-854-1
Type
conf
DOI
10.1109/SUPERC.1996.183533
Filename
1392902
Link To Document