• DocumentCode
    3543561
  • Title

    phiGEMM: A CPU-GPU Library for Porting Quantum ESPRESSO on Hybrid Systems

  • Author

    Spiga, Filippo ; Girotto, Ivan

  • Author_Institution
    Irish Centre for High-End Comput. (ICHEC), Dublin, Ireland
  • fYear
    2012
  • fDate
    15-17 Feb. 2012
  • Firstpage
    368
  • Lastpage
    375
  • Abstract
    GPU computing has revolutionized HPC by bringing the performance of the supercomputer to the desktop. Attractive price, performance, and power characteristics allow multiple GPUs to be plugged into both desktop machines as well as supercomputer nodes for increased performance. Excellent performance and scalability can be achieved for some problems using hybrid combinations of multiple GPUs and CPU computing resources. This paper presents the acceleration of the open-source Quantum ESPRESSO package with the freely available phiGEMM library. Specifically, the parallel implementation and scaling of the phiGEMM matrix-matrix multiplication will be discussed. This library can be called from applications through all standard GEMM interfaces and it is able to perform matrix-matrix multiplications using one or more GPUs as well as the host multi-core processor. An 8.9-times speedup is reported in overall run-time of a representative AUSURF112 benchmark for a PWscf calculation. In addition, multi-GPU scaling and performance for 3D-FFTs are discussed.
  • Keywords
    fast Fourier transforms; graphics processing units; matrix decomposition; multiprocessing systems; parallel machines; 3D-FFT; AUSURF112 benchmark; CPU-GPU library; GEMM interface; GPU computing; GPU-CPU computing resources; HPC; PWscf calculation; desktop machine; hybrid system; multiGPU scaling; multicore processor; open-source Quantum ESPRESSO package; parallel implementation; phiGEMM library; phiGEMM matrix-matrix multiplication; supercomputer node; Benchmark testing; Graphics processing unit; Kernel; Libraries; Performance evaluation; Supercomputers; 3D-FFT; CUDA; Density Functional Theory; application porting; matrix-matrix multiplication; multi-GPU; phiGEMM;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel, Distributed and Network-Based Processing (PDP), 2012 20th Euromicro International Conference on
  • Conference_Location
    Garching
  • ISSN
    1066-6192
  • Print_ISBN
    978-1-4673-0226-5
  • Type

    conf

  • DOI
    10.1109/PDP.2012.72
  • Filename
    6169574