DocumentCode :
599309
Title :
An Optimization of FMM under CPU+GPU Heterogeneous Architecture
Author :
Yonghua Zhu ; Xiao Lu
Author_Institution :
Comput. Center, Shanghai Univ., Shanghai, China
fYear :
2012
fDate :
9-11 Sept. 2012
Firstpage :
147
Lastpage :
150
Abstract :
Heterogeneous architecture of CPU+GPU has been the main trend for high-performance computing/parallel processing in recent years. However, the formulation of scientific algorithms to take advantage of the performance offered by the new architecture requires rethinking core methods. The algorithmic acceleration is achieved with the main part of fast multipole method (FMM) under the heterogeneous architecture. Based on PetFMM, a Two Dimensional Threads Mapping Model (TDTMM) is proposed to lighten the workload per thread on GPU. The presented threads mapping model is able to improve the execution efficiency of hardware acceleration. Experiment results show that the presented models are feasible and effective.
Keywords :
computational complexity; graphics processing units; multi-threading; parallel processing; CPU-GPU heterogeneous architecture; FMM optimization; TDTMM model; algorithmic acceleration; central processing unit; fast multipole method; graphics processing unit; hardware acceleration; high-performance computing; parallel processing; two dimensional threads mapping model; Acceleration; Computational modeling; Computer architecture; Graphics processing units; Hardware; Instruction sets; Kernel; FMM; GPU; Heterogeneous Architecture; Threads Mapping Model;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Commerce and Enterprise Computing (CEC), 2012 IEEE 14th International Conference on
Conference_Location :
Hangzhou
Print_ISBN :
978-1-4673-6246-7
Type :
conf
DOI :
10.1109/CEC.2012.33
Filename :
6470792
Link To Document :
بازگشت