مرکز منطقه ای اطلاع رساني علوم و فناوري - GPU Implementation of Finite Difference Solvers

DocumentCode :

233678

Title :

GPU Implementation of Finite Difference Solvers

Author :

Giles, Mike ; Laszlo, Endre ; Reguly, Istvan ; Appleyard, Jeremy ; Demouth, Julien

fYear :

2014

fDate :

16-16 Nov. 2014

Firstpage :

Lastpage :

Abstract :

This paper discusses the implementation of one-factor and three-factor PDE models on GPUs. Both explicit and implicit time-marching methods are considered, with the latter requiring the solution of multiple tridiagonal systems of equations.Because of the small amount of data involved, one-factor models are primarily compute-limited, with a very good fraction of the peak compute capability being achieved. The key to the performance lies in the heavy use of registers and shuffle instructions for the explicit method, and a non-standard hybrid Thomas/PCR algorithm for solving the tridiagonal systems for the implicit solverThe three-factor problems involve much more data, and hence their execution is more evenly balanced between computation and data communication to/from the main graphics memory. However, it is again possible to achieve a good fraction of the theoretical peak performance on both measures. The high performance requires particularly careful attention to coalescence in the data transfers, using local shared memory for small array transpositions, and padding to avoid shared memory bank conicts.Computational results include comparisons to computations on Sandy Bridge and Haswell Intel Xeon processors, using both multithreading and AVX vectorisation.

Keywords :

finite difference methods; graphics processing units; mathematics computing; partial differential equations; AVX vectorisation; GPU implementation; Haswell Intel Xeon processor; Sandy Bridge processor; finite difference solvers; graphics processing unit; hybrid Thomas-PCR algorithm; implicit time-marching methods; multithreading; one-factor PDE model; partial differential equations; registers; shuffle instructions; three-factor PDE model; tridiagonal system; Approximation algorithms; Approximation methods; Bandwidth; Equations; Graphics processing units; Instruction sets; Mathematical model; Computational finance; GPU computing; vectorisation; tridiagonal equations;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

High Performance Computational Finance (WHPCF), 2014 Seventh Workshop on

Conference_Location :

New Orleans, LA

Type :

conf

DOI :

10.1109/WHPCF.2014.10

Filename :

7016367

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=233678