• DocumentCode
    1296321
  • Title

    Highly scalable parallel algorithms for sparse matrix factorization

  • Author

    Gupta, Anshul ; Karypis, George ; Kumar, Vipin

  • Author_Institution
    IBM Thomas J. Watson Res. Center, Yorktown Heights, NY, USA
  • Volume
    8
  • Issue
    5
  • fYear
    1997
  • fDate
    5/1/1997 12:00:00 AM
  • Firstpage
    502
  • Lastpage
    520
  • Abstract
    In this paper, we describe scalable parallel algorithms for symmetric sparse matrix factorization, analyze their performance and scalability, and present experimental results for up to 1,024 processors on a Gray T3D parallel computer. Through our analysis and experimental results, we demonstrate that our algorithms substantially improve the state of the art in parallel direct solution of sparse linear systems-both in terms of scalability and overall performance. It is a well known fact that dense matrix factorization scales well and can be implemented efficiently on parallel computers. In this paper, we present the first algorithms to factor a wide class of sparse matrices (including those arising from two- and three-dimensional finite element problems) that are asymptotically as scalable as dense matrix factorization algorithms on a variety of parallel architectures. Our algorithms incur less communication overhead and are more scalable than any previously known parallel formulation of sparse matrix factorization. Although, in this paper, we discuss Cholesky factorization of symmetric positive definite matrices, the algorithms can be adapted for solving sparse linear least squares problems and for Gaussian elimination of diagonally dominant matrices that are almost symmetric in structure. An implementation of one of our sparse Cholesky factorization algorithms delivers up to 20 GFlops on a Gray T3D for medium-size structural engineering and linear programming problems. To the best of our knowledge, this is the highest performance ever obtained for sparse Cholesky factorization on any supercomputer
  • Keywords
    linear programming; parallel algorithms; parallel architectures; performance evaluation; sparse matrices; Cholesky factorization; Gaussian elimination; Gray T3D parallel computer; communication overhead; dense matrix factorization; diagonally dominant matrices; highly scalable parallel algorithms; linear programming; parallel architectures; parallel direct solution; performance; scalability; sparse Cholesky factorization algorithms; sparse linear least squares problems; sparse matrix factorization; symmetric positive definite matrices; Algorithm design and analysis; Concurrent computing; Finite element methods; Least squares methods; Parallel algorithms; Parallel architectures; Performance analysis; Scalability; Sparse matrices; Symmetric matrices;
  • fLanguage
    English
  • Journal_Title
    Parallel and Distributed Systems, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1045-9219
  • Type

    jour

  • DOI
    10.1109/71.598277
  • Filename
    598277