DocumentCode :
56036
Title :
GPU accelerated finite-element computation for electromagnetic analysis
Author :
Huan-Ting Meng ; Bao-Lin Nie ; Wong, Simon ; Macon, Charles ; Jian-Ming Jin
Author_Institution :
Dept. of Electr. & Comput. Eng., Univ. of Illinois at Urbana-Champaign, Urbana, IL, USA
Volume :
56
Issue :
2
fYear :
2014
fDate :
Apr-14
Firstpage :
39
Lastpage :
62
Abstract :
General-purpose computing on graphics processing units (GPGPU), with programming models such as the Compute Unified Device Architecture (CUDA) by NVIDIA, offers the capability for accelerating the solution process of computational electromagnetics analysis. However, due to the communication-intensive nature of the finite-element algorithm, both the assembly and the solution phases cannot be implemented via fine-grained many-core GPU processors in a straightforward manner. In this paper, we identify the bottlenecks in the GPU parallelization of the Finite-Element Method for electromagnetic analysis, and propose potential solutions to alleviate the bottlenecks. We first discuss efficient parallelization strategies for the finite-element matrix assembly on a single GPU and on multiple GPUs. We then explore parallelization strategies for the finite-element matrix solution, in conjunction with parallelizable preconditioners to reduce the total solution time. We show that with a proper parallelization and implementation, GPUs are able to achieve significant speedups over OpenMP-enabled multi-core CPUs.
Keywords :
computational electromagnetics; finite element analysis; graphics processing units; CUDA; GPGPU; GPU accelerated finite element computation; GPU parallelization; NVIDIA; OpenMP-enabled multicore CPU; communication intensive nature; computational electromagnetics analysis; compute unified device architecture; fine-grained many-core GPU processors; finite element algorithm; finite element matrix assembly; general purpose computing; graphics processing units; parallelizable preconditioners; programming models; Computational electromagnetics; Computer architecture; Finite element analysis; Frequency-domain analysis; Graphics processing units; High performance computing; Instruction sets; Computational electromagnetics; finite element analysis; frequency-domain analysis; graphics processing units; high performance computing; parallel programming;
fLanguage :
English
Journal_Title :
Antennas and Propagation Magazine, IEEE
Publisher :
ieee
ISSN :
1045-9243
Type :
jour
DOI :
10.1109/MAP.2014.6837065
Filename :
6837065
Link To Document :
بازگشت