Title :
Performance analysis and optimization of Gyrokinetic Torodial Code on TH-1A supercomputer
Author :
Zhu, Xiaoqian ; Liu, Xin ; Meng, Xiangfei ; Feng, Jinghua
Author_Institution :
Sch. of Comput., Nat. Univ. of Defense Technol., Changsha, China
Abstract :
In this study, we test and analyze the performance of Gyrokinetic Torodial Code(GTC) program. According to the analysis results, we port GTC\´s compute-intensive subroutines to GPU and speed up them on the "CPU+GPU" heterogeneous architecture of TH-1A supercomputer. Some optimization strategies are developed in this process, for example, subroutines are integrated to reduce the data transfer between host and device, GPU memory access is optimized to reduce the access latency and static keyword is designed before arrays\´ declaration to avoid unnecessary address allocation and data copy. Experiment results show that the performance of the subroutines ported to GPU is improved evidently, which is between 6 and 8 times, and the total performance of GTC could be improved by 3 to 4 times.
Keywords :
computer graphic equipment; coprocessors; parallel machines; performance evaluation; GPU; GPU memory access; GTC; Gyrokinetic Torodial Code optimisation; TH-1A supercomputer; array declaration; data transfer; optimization strategies; performance analysis; Algorithms; Arrays; Graphics processing unit; Multicore processing; Optimization; Supercomputers; GPU; GTC; TH-1A; high performance computing; nuclear fusion;
Conference_Titel :
Electrical and Control Engineering (ICECE), 2011 International Conference on
Conference_Location :
Yichang
Print_ISBN :
978-1-4244-8162-0
DOI :
10.1109/ICECENG.2011.6057053