• DocumentCode
    3205863
  • Title

    PATUS: A Code Generation and Autotuning Framework for Parallel Iterative Stencil Computations on Modern Microarchitectures

  • Author

    Christen, Matthias ; Schenk, Olaf ; Burkhart, Helmar

  • Author_Institution
    Dept. of Math. & Comput. Sci., Univ. of Basel, Basel, Switzerland
  • fYear
    2011
  • fDate
    16-20 May 2011
  • Firstpage
    676
  • Lastpage
    687
  • Abstract
    Stencil calculations comprise an important class of kernels in many scientific computing applications ranging from simple PDE solvers to constituent kernels in multigrid methods as well as image processing applications. In such types of solvers, stencil kernels are often the dominant part of the computation, and an efficient parallel implementation of the kernel is therefore crucial in order to reduce the time to solution. However, in the current complex hardware micro architectures, meticulous architecture-specific tuning is required to elicit the machine´s full compute power. We present a code generation and auto-tuning framework textsc{Patus} for stencil computations targeted at multi- and many core processors, such as multicore CPUs and graphics processing units, which makes it possible to generate compute kernels from a specification of the stencil operation and a parallelization and optimization strategy, and leverages the auto tuning methodology to optimize strategy-dependent parameters for the given hardware architecture.
  • Keywords
    computer architecture; computer graphic equipment; coprocessors; multiprocessing systems; parallel processing; partial differential equations; PATUS; PDE solvers; autotuning framework; code generation; complex hardware microarchitectures; graphics processing units; image processing; many core processors; meticulous architecture-specific tuning; multicore CPU; multicore processors; multigrid methods; parallel iterative stencil computations; stencil kernels; strategy-dependent parameters; Arrays; Hardware; Instruction sets; Kernel; Parallel processing; Three dimensional displays;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel & Distributed Processing Symposium (IPDPS), 2011 IEEE International
  • Conference_Location
    Anchorage, AK
  • ISSN
    1530-2075
  • Print_ISBN
    978-1-61284-372-8
  • Electronic_ISBN
    1530-2075
  • Type

    conf

  • DOI
    10.1109/IPDPS.2011.70
  • Filename
    6012879