Title :
GPU-Based Implementation of Finite Element Method for Elasticity Using CUDA
Author :
Jianfei Zhang ; Defei Shen
Author_Institution :
Coll. of Mech. & Mater., Hohai Univ., Nanjing, China
Abstract :
Graphics Processing Unit (GPU) has obtained great success in scientific computations for its tremendous computational horsepower and very high memory bandwidth. This paper discusses the way to accelerate the finite element method (FEM) for elasticity problem on NVIDIA GPUs using Compute Unified Device Architecture (CUDA), mainly including formation and solution of finite element equations. Multiple strategies for efficiently accessing global memory are introduced to achieve memory coalescing. Shared memory is utilized to reuse data for enhancing memory bandwidth efficiency. The numerical results show that GPU-accelerated finite element computation with using these optimizing strategies can achieve a speedup of around 10-20x to serial version for different element types.
Keywords :
finite element analysis; graphics processing units; parallel architectures; CUDA; FEM; GPU-accelerated finite element computation; NVIDIA GPU; computational horsepower; compute unified device architecture; elasticity problem; finite element equations; finite element method; global memory; graphics processing unit; memory bandwidth; memory coalescing; scientific computations; shared memory; Equations; Finite element analysis; Graphics processing units; Instruction sets; Kernel; Mathematical model; Vectors; CUDA; Conjugate gradient method; Elasticity; Finite element method;
Conference_Titel :
High Performance Computing and Communications & 2013 IEEE International Conference on Embedded and Ubiquitous Computing (HPCC_EUC), 2013 IEEE 10th International Conference on
Conference_Location :
Zhangjiajie
DOI :
10.1109/HPCC.and.EUC.2013.142