DocumentCode :
2958529
Title :
Automatic Resource Scheduling with Latency Hiding for Parallel Stencil Applications on GPGPU Clusters
Author :
Maeda, Kumiko ; Murase, Masana ; Doi, Munehiro ; Komatsu, Hideaki ; Noda, Shigeho ; Himeno, Ryutaro
Author_Institution :
IBM Res. - Tokyo, IBM Japan, Ltd., Tokyo, Japan
fYear :
2012
fDate :
21-25 May 2012
Firstpage :
544
Lastpage :
556
Abstract :
Overlapping computations and communication is a key to accelerating stencil applications on parallel computers, especially for GPU clusters. However, such programming is a time-consuming part of the stencil application development. To address this problem, we developed an automatic code generation tool to produce a parallel stencil application with latency hiding automatically from its dataflow model. With this tool, users visually construct the workflows of stencil applications in a dataflow programming model. Our dataflow compiler determines a data decomposition policy for each application, and generates source code that overlaps the stencil computations and communication (MPI and PCIe). We demonstrate two types of overlapping models, a CPU-GPU hybrid execution model and a GPU-only model. We use a CFD benchmark computing 19-point 3D stencils to evaluate our scheduling performance, which results in 1.45 TFLOPS in single precision on a cluster with 64 Tesla C1060 GPUs.
Keywords :
graphics processing units; parallel processing; partial differential equations; processor scheduling; search problems; GPGPU Clusters; PDE; automatic code generation tool; automatic resource scheduling; data decomposition; dataflow compiler; dataflow programming model; latency hiding; parallel computers; parallel stencil applications; partial differential equations; source code generation; Computational modeling; Graphics processing unit; Hardware; Jacobian matrices; Kernel; Peer to peer computing; Rivers; latency hiding; network embedding; resource scheduling; stencil computations;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel & Distributed Processing Symposium (IPDPS), 2012 IEEE 26th International
Conference_Location :
Shanghai
ISSN :
1530-2075
Print_ISBN :
978-1-4673-0975-2
Type :
conf
DOI :
10.1109/IPDPS.2012.57
Filename :
6267857
Link To Document :
بازگشت