DocumentCode :
589955
Title :
High locality and increased intra-node parallelism for solving finite element models on GPUs by novel element-by-element implementation
Author :
Kiss, Istvan ; Badics, Zsolt ; Gyimothy, Szabolcs ; Pavo, Jozsef
Author_Institution :
Budapest Univ. of Technol. & Econ., Budapest, Hungary
fYear :
2012
fDate :
10-12 Sept. 2012
Firstpage :
1
Lastpage :
5
Abstract :
The utilization of Graphical Processing Units (GPUs) for the element-by-element (EbE) finite element method (FEM) is demonstrated. EbE FEM is a long known technique, by which a conjugate gradient (CG) type iterative solution scheme can be entirely decomposed into computations on the element level, i.e., without assembling the global system matrix. In our implementation, NVIDIA´s parallel computing solution, the Compute Unified Device Architecture (CUDA), is used to perform the required element-wise computations in parallel. Since element matrices need not be stored, the memory requirement can be kept extremely low. It is shown that this low-storage but computation-intensive technique is better suited for GPUs than those requiring the massive manipulation of large data sets. This study of the proposed parallel model illustrates a highly improved locality and minimization of data movement, which could also significantly reduce energy consumption in other heterogeneous HPC architectures.
Keywords :
conjugate gradient methods; finite element analysis; graphics processing units; parallel architectures; CG type iterative solution scheme; CUDA; Compute Unified Device Architecture; EbE; FEM; GPU; NVIDIA parallel computing solution; computation-intensive technique; conjugate gradient type iterative solution scheme; data movement minimization; element-by-element implementation; element-wise computations; energy consumption reduction; finite element models; graphical processing units; heterogeneous HPC architectures; high locality; intranode parallelism; parallel model; Computational modeling; Finite element methods; Graphics processing units; Kernel; Matrix decomposition; Sparse matrices; Vectors; CUDA Computing; EbE FEM; GPU Computing; parallel FEM;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
High Performance Extreme Computing (HPEC), 2012 IEEE Conference on
Conference_Location :
Waltham, MA
Print_ISBN :
978-1-4673-1577-7
Type :
conf
DOI :
10.1109/HPEC.2012.6408659
Filename :
6408659
Link To Document :
بازگشت