• DocumentCode
    167532
  • Title

    Nuclear Fusion Simulation Code Optimization and Performance Evaluation on GPU Cluster

  • Author

    Fujita, Norihisa ; Nuga, Hideo ; Boku, Taisuke ; Idomura, Yasuhiro

  • Author_Institution
    Grad. Sch. of Syst. & Inf. Eng., Univ. of Tsukuba, Tsukuba, Japan
  • fYear
    2014
  • fDate
    19-23 May 2014
  • Firstpage
    1266
  • Lastpage
    1274
  • Abstract
    The conservative global gyrokinetic toroidal full-f five-dimensional Vlasov simulation (GT5D) is a nuclear fusion simulation program designed to analyze turbulence phenomena in tokamak plasma. In this research, we optimize it for graphics processing unit (GPU) clusters with multiple GPUs on each node. Based on the profile results of a GT5D on a CPU node, it was decided to offload the entire time development part of the program to GPUs, except for MPI communication. Our evaluation results show we achieved a maximum 3.35 times faster performance with a GPU during a function level execution, and 1.91 times faster total performance, than could be achieved via CPU-only execution, both in measurements on high density GPU cluster HA-PACS, where each computation node consists of four NVIDIA M2090 GPUs and two Intel Xeon E5-2670 (SandyBridge) that provide 16 cores in total. Note that theses performance improvements for a single GPU were obtained in measurements against four CPU cores, not a single-core CPU, and include a 63% performance gain obtained by communications overlapping between MPI processes and GPU calculations.
  • Keywords
    Tokamak devices; application program interfaces; graphics processing units; nuclear engineering computing; nuclear fusion; plasma toroidal confinement; GPU cluster; GT5D; HA-PACS; Intel Xeon E5-2670; MPI communication; NVIDIA M2090 GPU; global gyrokinetic toroidal full-f five-dimensional Vlasov simulation; graphics processing unit; nuclear fusion simulation code optimization; performance evaluation; tokamak plasma; turbulence phenomena; Computational modeling; Data transfer; Fusion reactors; Graphics processing units; Instruction sets; Kernel; Plasmas; CUDA; GPGPU; Nuclear Fusion;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel & Distributed Processing Symposium Workshops (IPDPSW), 2014 IEEE International
  • Conference_Location
    Phoenix, AZ
  • Print_ISBN
    978-1-4799-4117-9
  • Type

    conf

  • DOI
    10.1109/IPDPSW.2014.142
  • Filename
    6969524