مرکز منطقه ای اطلاع رساني علوم و فناوري - High locality and increased intra-node parallelism for solving finite element models on GPUs by novel element-by-element implementation

DocumentCode :

589955

Title :

High locality and increased intra-node parallelism for solving finite element models on GPUs by novel element-by-element implementation

Author :

Kiss, Istvan ; Badics, Zsolt ; Gyimothy, Szabolcs ; Pavo, Jozsef

Author_Institution :

Budapest Univ. of Technol. & Econ., Budapest, Hungary

fYear :

2012

fDate :

10-12 Sept. 2012

Firstpage :

Lastpage :

Abstract :

The utilization of Graphical Processing Units (GPUs) for the element-by-element (EbE) finite element method (FEM) is demonstrated. EbE FEM is a long known technique, by which a conjugate gradient (CG) type iterative solution scheme can be entirely decomposed into computations on the element level, i.e., without assembling the global system matrix. In our implementation, NVIDIA´s parallel computing solution, the Compute Unified Device Architecture (CUDA), is used to perform the required element-wise computations in parallel. Since element matrices need not be stored, the memory requirement can be kept extremely low. It is shown that this low-storage but computation-intensive technique is better suited for GPUs than those requiring the massive manipulation of large data sets. This study of the proposed parallel model illustrates a highly improved locality and minimization of data movement, which could also significantly reduce energy consumption in other heterogeneous HPC architectures.

Keywords :

conjugate gradient methods; finite element analysis; graphics processing units; parallel architectures; CG type iterative solution scheme; CUDA; Compute Unified Device Architecture; EbE; FEM; GPU; NVIDIA parallel computing solution; computation-intensive technique; conjugate gradient type iterative solution scheme; data movement minimization; element-by-element implementation; element-wise computations; energy consumption reduction; finite element models; graphical processing units; heterogeneous HPC architectures; high locality; intranode parallelism; parallel model; Computational modeling; Finite element methods; Graphics processing units; Kernel; Matrix decomposition; Sparse matrices; Vectors; CUDA Computing; EbE FEM; GPU Computing; parallel FEM;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

High Performance Extreme Computing (HPEC), 2012 IEEE Conference on

Conference_Location :

Waltham, MA

Print_ISBN :

978-1-4673-1577-7

Type :

conf

DOI :

10.1109/HPEC.2012.6408659

Filename :

6408659

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=589955