DocumentCode
2958529
Title
Automatic Resource Scheduling with Latency Hiding for Parallel Stencil Applications on GPGPU Clusters
Author
Maeda, Kumiko ; Murase, Masana ; Doi, Munehiro ; Komatsu, Hideaki ; Noda, Shigeho ; Himeno, Ryutaro
Author_Institution
IBM Res. - Tokyo, IBM Japan, Ltd., Tokyo, Japan
fYear
2012
fDate
21-25 May 2012
Firstpage
544
Lastpage
556
Abstract
Overlapping computations and communication is a key to accelerating stencil applications on parallel computers, especially for GPU clusters. However, such programming is a time-consuming part of the stencil application development. To address this problem, we developed an automatic code generation tool to produce a parallel stencil application with latency hiding automatically from its dataflow model. With this tool, users visually construct the workflows of stencil applications in a dataflow programming model. Our dataflow compiler determines a data decomposition policy for each application, and generates source code that overlaps the stencil computations and communication (MPI and PCIe). We demonstrate two types of overlapping models, a CPU-GPU hybrid execution model and a GPU-only model. We use a CFD benchmark computing 19-point 3D stencils to evaluate our scheduling performance, which results in 1.45 TFLOPS in single precision on a cluster with 64 Tesla C1060 GPUs.
Keywords
graphics processing units; parallel processing; partial differential equations; processor scheduling; search problems; GPGPU Clusters; PDE; automatic code generation tool; automatic resource scheduling; data decomposition; dataflow compiler; dataflow programming model; latency hiding; parallel computers; parallel stencil applications; partial differential equations; source code generation; Computational modeling; Graphics processing unit; Hardware; Jacobian matrices; Kernel; Peer to peer computing; Rivers; latency hiding; network embedding; resource scheduling; stencil computations;
fLanguage
English
Publisher
ieee
Conference_Titel
Parallel & Distributed Processing Symposium (IPDPS), 2012 IEEE 26th International
Conference_Location
Shanghai
ISSN
1530-2075
Print_ISBN
978-1-4673-0975-2
Type
conf
DOI
10.1109/IPDPS.2012.57
Filename
6267857
Link To Document