DocumentCode
3410364
Title
An LU factorization algorithm for parallel supercomputers with memory hierarchies
Author
Seo, Yoshiki ; Shiroto, Yukimasa ; Nishi, Naoki ; Nakazaki, Ryousei
Author_Institution
NEC Corp., Kanagawa, Japan
fYear
1991
fDate
9-10 May 1991
Firstpage
17
Abstract
A parallel algorithm for solving LU factorization of huge dense matrices was developed for parallel vector supercomputers with a hierarchy of memory layers (i.e., local memories, shared memory, semiconductor extended storage, and magnetic disk). The algorithm is based on Gaussian elimination and optimizes data transfers among memory layers by recursively using a block partitioning method. Using four memory layers, an LU factorization for a 32768×32768 dense matrix was calculated in 640 min on the HPP-LHS supercomputer system developed under the MITI (Ministry of International Trade and Industry) Supercomputer Project. Required memory capacity for the gigantic matrix is 8 GB, and the whole matrix data area was allocated to magnetic disk for this calculation. The execution speed with four processors was 2.8 times faster than that with one processor, even using a magnetic disk, and the algorithm was proved to be effective
Keywords
matrix algebra; parallel algorithms; parallel machines; Gaussian elimination; HPP-LHS supercomputer system; LU factorization algorithm; execution speed; gigantic matrix; memory hierarchies; parallel vector supercomputers; Algorithm design and analysis; International trade; Laboratories; Large-scale systems; Magnetic semiconductors; National electric code; Optimization methods; Parallel algorithms; Partitioning algorithms; Supercomputers;
fLanguage
English
Publisher
ieee
Conference_Titel
Communications, Computers and Signal Processing, 1991., IEEE Pacific Rim Conference on
Conference_Location
Victoria, BC
Print_ISBN
0-87942-638-1
Type
conf
DOI
10.1109/PACRIM.1991.160670
Filename
160670
Link To Document