• DocumentCode
    688203
  • Title

    A Comparative Study of Preconditioners for GPU-Accelerated Conjugate Gradient Solver

  • Author

    Yao Chen ; Yonghua Zhao ; Wei Zhao ; Lian Zhao

  • Author_Institution
    Comput. Network Inf. Center, Univ. of Chinese Acad. of Sci., Beijing, China
  • fYear
    2013
  • fDate
    13-15 Nov. 2013
  • Firstpage
    628
  • Lastpage
    635
  • Abstract
    We compare two types of preconditioners for GPU-Accelerated conjugate gradient solver. For the standard IC preconditioner, we exploit level scheduling to increase multi-thread parallelism of sparse triangular solve on GPU. Meanwhile, we propose a novel reordering technique to maximize the coalescing of global memory accesses. For the approximate inverse preconditioner SSOR-AI, we extend it to second order approximation. Experiments indicate that our IC PCG runs 25% faster than using vendor implementation in CUSPARSE library and SSOR-AI PCG can be twice as fast as IC PCG.
  • Keywords
    graphics processing units; multi-threading; scheduling; CUSPARSE library; GPU-accelerated conjugate gradient solver; IC PCG; SSOR-AI PCG; global memory accesses; inverse preconditioner SSOR-AI; level scheduling; multithread parallelism; reordering technique; second order approximation; standard IC preconditioner; Approximation methods; Graphics processing units; Integrated circuits; Linear systems; Optimization; Parallel processing; Sparse matrices; Approximate Inverse; Conjugate Gradient; GPU; Incomplete Cholesky; Preconditioner;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    High Performance Computing and Communications & 2013 IEEE International Conference on Embedded and Ubiquitous Computing (HPCC_EUC), 2013 IEEE 10th International Conference on
  • Conference_Location
    Zhangjiajie
  • Type

    conf

  • DOI
    10.1109/HPCC.and.EUC.2013.94
  • Filename
    6831976