• DocumentCode
    3648615
  • Title

    Automatically Optimized GPU Acceleration of Element Subroutines in Finite Element Method

  • Author

    Jirí Filipovic;Jan Fousek; Lakomý;Matú Madzin

  • fYear
    2012
  • fDate
    7/1/2012 12:00:00 AM
  • Firstpage
    141
  • Lastpage
    144
  • Abstract
    The element subroutines in finite element method (FEM) provides enough parallelism to be successfully accelerated by contemporary GPUs. However, their efficient implementation is not straightforward and requires time-consuming exploration of numerous implementation variants. In this paper, we present kernel fusion as an optimization technique and its application for element subroutines. Moreover, we show how the kernel fusion is automated using our source-to-source compiler. We demonstrate the optimization of the element subroutines for FEM model using St. Venant-Kirchhoff material. The performance of code generated by our compiler outperforms our previously published hand-tuned implementation by factor of 1.32 - 1.54 depending on used GPU architecture. Although the optimization techniques are demonstrated on element subroutines for using St. Venant-Kirchhoff material, they are generally usable for wider area of computationally-demanding problems.
  • Keywords
    "Finite element methods","Kernel","Algorithms","Graphics processing unit","Registers","Assembly","Acceleration"
  • Publisher
    ieee
  • Conference_Titel
    Application Accelerators in High Performance Computing (SAAHPC), 2012 Symposium on
  • ISSN
    2166-5133
  • Print_ISBN
    978-1-4673-2882-1
  • Electronic_ISBN
    2166-515X
  • Type

    conf

  • DOI
    10.1109/SAAHPC.2012.23
  • Filename
    6319203