• DocumentCode
    1998034
  • Title

    Performance Dissection of a Molecular Dynamics Code across CUDA and GPU Generations

  • Author

    Wezowicz, M. ; Estrada, Trilce ; Patel, Surabhi ; Taufer, Michela

  • Author_Institution
    Univ. of Delaware, Newark, DE, USA
  • fYear
    2013
  • fDate
    20-24 May 2013
  • Firstpage
    1355
  • Lastpage
    1364
  • Abstract
    The first release of CUDA was in 2007. Since then, we have experienced frequent new releases. CUDA reached its maximum performance with CUDA 4.0. Since its release, NVIDIA has started a re-design of the CUDA framework driven by software engineering principles, i.e., the search for a general, multi-layer framework whose compiler back-end is unified with OpenCL. At the same time, the GPU architecture has been moving from Fermi to Kepler by including faster memory speed larger numbers of stream processors, and redesigned pipelines. The two directions have orthogonal results in terms of performance. The software generality has resulted in the slow down of codes that were heavily optimized for older generations of CUDA, while the hardware improvements have resulted in steadily increasing performance. The performance dissection presented in this paper identifies sweet spots and trade-offs between software generality and hardware improvements for a diverse set of kernels in an open-source molecular dynamics code.
  • Keywords
    graphics processing units; molecular dynamics method; parallel architectures; pipeline processing; program compilers; public domain software; CUDA 4.0; GPU architecture; NVIDIA; OpenCL; compiler backend; multilayer framework; open source molecular dynamics code; pipeline redesign; software engineering principle; software generality; stream processor; Computational modeling; Computer architecture; Graphics processing units; Hardware; Kernel; CUDA; Fermi GPUs; Kepler GPUs; OpenCL; membrane systems;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel and Distributed Processing Symposium Workshops & PhD Forum (IPDPSW), 2013 IEEE 27th International
  • Conference_Location
    Cambridge, MA
  • Print_ISBN
    978-0-7695-4979-8
  • Type

    conf

  • DOI
    10.1109/IPDPSW.2013.112
  • Filename
    6651028