Title :
An Optimization of FMM under CPU+GPU Heterogeneous Architecture
Author :
Yonghua Zhu ; Xiao Lu
Author_Institution :
Comput. Center, Shanghai Univ., Shanghai, China
Abstract :
Heterogeneous architecture of CPU+GPU has been the main trend for high-performance computing/parallel processing in recent years. However, the formulation of scientific algorithms to take advantage of the performance offered by the new architecture requires rethinking core methods. The algorithmic acceleration is achieved with the main part of fast multipole method (FMM) under the heterogeneous architecture. Based on PetFMM, a Two Dimensional Threads Mapping Model (TDTMM) is proposed to lighten the workload per thread on GPU. The presented threads mapping model is able to improve the execution efficiency of hardware acceleration. Experiment results show that the presented models are feasible and effective.
Keywords :
computational complexity; graphics processing units; multi-threading; parallel processing; CPU-GPU heterogeneous architecture; FMM optimization; TDTMM model; algorithmic acceleration; central processing unit; fast multipole method; graphics processing unit; hardware acceleration; high-performance computing; parallel processing; two dimensional threads mapping model; Acceleration; Computational modeling; Computer architecture; Graphics processing units; Hardware; Instruction sets; Kernel; FMM; GPU; Heterogeneous Architecture; Threads Mapping Model;
Conference_Titel :
Commerce and Enterprise Computing (CEC), 2012 IEEE 14th International Conference on
Conference_Location :
Hangzhou
Print_ISBN :
978-1-4673-6246-7
DOI :
10.1109/CEC.2012.33