• DocumentCode
    1467304
  • Title

    Cyclic Reduction Tridiagonal Solvers on GPUs Applied to Mixed-Precision Multigrid

  • Author

    Göddeke, Dominik ; Strzodka, Robert

  • Author_Institution
    Dept. of Appl. Math., Tech. Univ. Dortmund, Dortmund, Germany
  • Volume
    22
  • Issue
    1
  • fYear
    2011
  • Firstpage
    22
  • Lastpage
    32
  • Abstract
    We have previously suggested mixed precision iterative solvers specifically tailored to the iterative solution of sparse linear equation systems as they typically arise in the finite element discretization of partial differential equations. These schemes have been evaluated for a number of hardware platforms, in particular, single-precision GPUs as accelerators to the general purpose CPU. This paper reevaluates the situation with new mixed precision solvers that run entirely on the GPU: We demonstrate that mixed precision schemes constitute a significant performance gain over native double precision. Moreover, we present a new implementation of cyclic reduction for the parallel solution of tridiagonal systems and employ this scheme as a line relaxation smoother in our GPU-based multigrid solver. With an alternating direction implicit variant of this advanced smoother, we can extend the applicability of the GPU multigrid solvers to very ill-conditioned systems arising from the discretization on anisotropic meshes, that previously had to be solved on the CPU. The resulting mixed-precision schemes are always faster than double precision alone, and outperform tuned CPU solvers consistently by almost an order of magnitude.
  • Keywords
    computer graphic equipment; coprocessors; elliptic equations; finite element analysis; iterative methods; partial differential equations; GPU; NVIDIA CUDA; anisotropic mesh; cyclic reduction tridiagonal solver; finite element discretization; mixed precision iterative refinement; multigrid system; partial differential equation; sparse linear equation system; Anisotropic magnetoresistance; Computer architecture; Differential equations; Finite element methods; Graphics; Hardware; Iterative methods; Jacobian matrices; Partial differential equations; Performance gain; GPU Computing; NVIDIA CUDA.; cyclic reduction; finite elements; mixed-precision iterative refinement; multigrid; tridiagonal solvers;
  • fLanguage
    English
  • Journal_Title
    Parallel and Distributed Systems, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1045-9219
  • Type

    jour

  • DOI
    10.1109/TPDS.2010.61
  • Filename
    5445081