• DocumentCode
    2002355
  • Title

    Matrix-vector multiplication and triangular linear solver using GPGPU for symmetric positive definite matrices derived from elliptic equations

  • Author

    de Castro Martins, T. ; de Miranda Kian, J. ; Kubagawa Sato, Andre ; de Sales Guerra Tsuzuki, Marcos

  • Author_Institution
    Dept. de Eng. Mecatronica e de Sist. Mecanicos, Univ. de Sao Paulo, Sao Paulo, Brazil
  • fYear
    2012
  • fDate
    20-24 Nov. 2012
  • Firstpage
    1286
  • Lastpage
    1291
  • Abstract
    The modern GPUs are well suited for intensive computational tasks and massive parallel computation. Sparse matrix multiplication and linear triangular solver are the most important and heavily used kernels in scientific computation, and several challenges in developing a high performance kernel with the two modules is investigated. The main interest it to solve linear systems derived from the elliptic equations with triangular elements. The resulting linear system has a symmetric positive definite matrix. The sparse matrix is stored in the compressed sparse row (CSR) format. It is proposed a CUDA algorithm to execute the matrix vector multiplication using directly the CSR format. A dependence tree algorithm is used to determine which variables the linear triangular solver can determine in parallel. To increase the number of the parallel threads, a coloring graph algorithm is implemented to reorder the mesh numbering in a pre-processing phase. The proposed method is compared with parallel and serial available libraries. The results show that the proposed method improves the computation cost of the matrix vector multiplication. The pre-processing associated with the triangular solver needs to be executed just once in the proposed method. The conjugate gradient method was implemented and showed similar convergence rate for all the compared methods. The proposed method showed significant smaller execution time.
  • Keywords
    conjugate gradient methods; graph colouring; graphics processing units; mathematics computing; matrix multiplication; parallel processing; trees (mathematics); vectors; CSR format; CUDA algorithm; GPGPU; coloring graph algorithm; compressed sparse row format; conjugate gradient method; dependence tree algorithm; elliptic equation; general-purpose graphics processing unit; matrix vector multiplication; matrix-vector multiplication; mesh numbering; parallel computation; parallel library; parallel thread; serial library; symmetric positive definite matrix; triangular linear solver; GPGPU; Massive Parallelization; Matrix-Vector Multiplication; Sparse Matrix;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Soft Computing and Intelligent Systems (SCIS) and 13th International Symposium on Advanced Intelligent Systems (ISIS), 2012 Joint 6th International Conference on
  • Conference_Location
    Kobe
  • Print_ISBN
    978-1-4673-2742-8
  • Type

    conf

  • DOI
    10.1109/SCIS-ISIS.2012.6505077
  • Filename
    6505077