DocumentCode :
3564316
Title :
Efficient utilization of memory hierarchy to enable the computation on bigger domains for stencil computation in CPU-GPU based systems
Author :
Guanghao Jin ; Lin, James ; Endo, Toshio
Author_Institution :
JST-CREST, Tokyo Inst. of Technol., Tokyo, Japan
fYear :
2014
Firstpage :
1
Lastpage :
6
Abstract :
The problem size of the stencil computation is limited by the memory capacity GPU, which is typically smaller than that of CPU memory in CPU-GPU based super computers. To enable bigger domain computation while maintaining high performance, this paper proposes and evaluates optimization methods for stencil computation by efficient utilization of the memory hierarchy in those systems. It uses temporal blocking method to enable bigger domain computation on GPU. Then, it uses buffer-copy method to solve redundancy problem of temporal blocking method. To solve the space consumption problem of buffer-copy method on GPU side, it sets buffer on the CPU side. Evaluation of stencil simulation on 3D domain shows that our new method for 7-point and 13-point stencils achieves good performance which is 1.22 times and 1.28 times higher than other methods on average.
Keywords :
buffer storage; graphics processing units; optimisation; parallel processing; redundancy; 13-point stencils; 3D domain; 7-point stencils; CPU side; CPU-CPU based super computers; CPU-GPU based systems; bigger domain computation; buffer-copy method; memory capacity CPU; memory hierarchy utilization; optimization methods; redundancy problem; space consumption problem; stencil computation; temporal blocking method; Abstracts; Optimization; CPU-GPU based super computers; memory capacity; memory hierarchy; optimization methods; stencil computation; temporal blocking;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
High Performance Computing and Applications (ICHPCA), 2014 International Conference on
Print_ISBN :
978-1-4799-5957-0
Type :
conf
DOI :
10.1109/ICHPCA.2014.7045354
Filename :
7045354
Link To Document :
بازگشت