• DocumentCode
    580113
  • Title

    Multi-core acceleration of chemical kinetics for simulation and prediction

  • Author

    Linford, J.C. ; Michalakes, J. ; Vachharajani, M. ; Sandu, Andreea

  • Author_Institution
    Virginia Polytech. Inst. & State Univ., Blacksburg, VA, USA
  • fYear
    2009
  • fDate
    14-20 Nov. 2009
  • Firstpage
    1
  • Lastpage
    11
  • Abstract
    This work implements a computationally expensive chemical kinetics kernel from a large-scale community atmospheric model on three multi-core platforms: NVIDIA GPUs using CUDA, the Cell Broadband Engine, and Intel Quad-Core Xeon CPUs. A comparative performance analysis for each platform in double and single precision on coarse and fine grids is presented. Platform-specific design and optimization is discussed in a mechanism-agnostic way, permitting the optimization of many chemical mechanisms. The implementation of a three-stage Rosenbrock solver for SIMD architectures is discussed. When used as a template mechanism in the the Kinetic PreProcessor, the multi-core implementation enables the automatic optimization and porting of many chemical mechanisms on a variety of multi-core platforms. Speedups of 5.5x in single precision and 2.7x in double precision are observed when compared to eight Xeon cores. Compared to the serial implementation, the maximum observed speedup is 41.1x in single precision.
  • Keywords
    multiprocessing systems; optimisation; parallel architectures; reaction kinetics; CUDA; Cell Broadband Engine; Intel Quad-Core Xeon CPU; Kinetic PreProcessor; NVIDIA GPU; SIMD architecture; Xeon core; automatic optimization; chemical kinetics kernel; chemical mechanism; coarse grid; fine grid; large-scale community atmospheric model; multicore acceleration; multicore implementation; multicore platform; platform-specific design; template mechanism; three-stage Rosenbrock solver; NVIDIA CUDA; atmospheric modeling; cell broadband engine; chemical kinetics; kinetic preprocessor; multi-core; open-MP;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    High Performance Computing Networking, Storage and Analysis, Proceedings of the Conference on
  • Conference_Location
    Portland, OR
  • Type

    conf

  • DOI
    10.1145/1654059.1654067
  • Filename
    6375562