DocumentCode :
3223594
Title :
190 TFlops Astrophysical N-body Simulation on a Cluster of GPUs
Author :
Hamada, Tsuyoshi ; Nitadori, Keigo
Author_Institution :
Nagasaki Adv. Comput. Center, Nagasaki Univ., Nagasaki, Japan
fYear :
2010
fDate :
13-19 Nov. 2010
Firstpage :
1
Lastpage :
9
Abstract :
We present the results of a hierarchical N-body simulation on DEGIMA, a cluster of PCs with 576 graphic processing units (GPUs) and using an InfiniBand interconnect. DEGIMA stands for DEstination for GPU Intensive MAchine, and is located at Nagasaki Advanced Computing Center (NACC), Nagasaki University. In this work, we have upgraded DEGIMA_s interconnect using InfiniBand. DEGIMA is composed by 144 nodes with 576 GT200 GPUs. An astrophysical N-body simulation with 3,278,982,596 particles using a treecode algorithm shows a sustained performance of 190.5 Tflops on DEGIMA. The overall cost of the hardware was $411,921 dollars. The maximum corrected performance is 104.8 Tflops for the simulation, resulting in a cost performance of 254.4 MFlops/$. This corrections is performed by counting the FLOPS based on the most efficient CPU algorithm. Any extra FLOPS that arise from the GPU implementation and parameter differences are not included in the 254.4 MFLOPS/$.
Keywords :
N-body simulations (astronomical); gravitation; DEGIMA; DEstination for GPU Intensive MAchine; GT200 GPUs; InfiniBand interconnect; Nagasaki Advanced Computing Center; astrophysical N-body simulation; graphic processing units; gravitation; treecode algorithm; Computational modeling; Force; Graphics processing unit; Instruction sets; Kernel; Pipelines;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
High Performance Computing, Networking, Storage and Analysis (SC), 2010 International Conference for
Conference_Location :
New Orleans, LA
Print_ISBN :
978-1-4244-7557-5
Electronic_ISBN :
978-1-4244-7558-2
Type :
conf
DOI :
10.1109/SC.2010.1
Filename :
5644906
Link To Document :
بازگشت