DocumentCode
166643
Title
Software technologies coping with memory hierarchy of GPGPU clusters for stencil computations
Author
Endo, T. ; Guanghao Jin
Author_Institution
Global Sci. Inf. & Comput. Center, Tokyo Inst. of Technol./JST-CREST, Tokyo, Japan
fYear
2014
fDate
22-26 Sept. 2014
Firstpage
132
Lastpage
139
Abstract
Stencil computations, which are important kernels for CFD simulations, have been highly successful on GPGPU clusters, due to high memory bandwidth and computation speed of GPU accelerators. However, sizes of the computed domains are limited by small capacity of GPU device memory. In order to support larger domain sizes, we utilize the memory hierarchy of GPGPU clusters; larger host memory is used for maintain large domains. However, it is challenging to achieve all of larger domain sizes, high performance and easiness of program development. Towards this goal, we combine two software technologies. From the aspect of algorithm, we adopt a locality improvement technique called temporal blocking. From the aspect of system software, we developed a MPI/CUDA wrapper library named HHRT, which supports memory swapping and finer grained programming model. With this combination, we demonstrate that our goal is achieved through evaluations on TSUBAME2.5, a petascale GPGPU supercomputer.
Keywords
application program interfaces; computational fluid dynamics; graphics processing units; message passing; parallel architectures; CFD simulations; GPGPU clusters; GPU accelerators; GPU device memory; HHRT; MPI/CUDA wrapper library; TSUBAME2.5; computation speed; grained programming model; kernels; locality improvement technique; memory bandwidth; memory hierarchy; memory swapping; petascale GPGPU supercomputer; program development; software technologies; stencil computations; temporal blocking; Arrays; Bandwidth; Graphics processing units; Libraries; Performance evaluation; Programming; Supercomputers;
fLanguage
English
Publisher
ieee
Conference_Titel
Cluster Computing (CLUSTER), 2014 IEEE International Conference on
Conference_Location
Madrid
Type
conf
DOI
10.1109/CLUSTER.2014.6968747
Filename
6968747
Link To Document