DocumentCode :
598584
Title :
Bamboo -- Translating MPI applications to a latency-tolerant, data-driven form
Author :
Tan Nguyen ; Cicotti, Pietro ; Bylaska, E. ; Quinlan, Daniel ; Baden, Scott B.
Author_Institution :
Dept. of Comput. Sci. & Eng., Univ. of California, San Diego, La Jolla, CA, USA
fYear :
2012
fDate :
10-16 Nov. 2012
Firstpage :
1
Lastpage :
11
Abstract :
We present Bamboo, a custom source-to-source translator that transforms MPI C source into a data-driven form that automatically overlaps communication with available computation. Running on up to 98304 processors of NERSC´s Hopper system, we observe that Bamboo´s overlap capability speeds up MPI implementations of a 3D Jacobi iterative solver and Cannon´s matrix multiplication. Bamboo´s generated code meets or exceeds the performance of hand optimized MPI, which includes split-phase coding, the method classically employed to hide communication. We achieved our results with only modest amounts of programmer annotation and no intrusive reprogramming of the original application source.
Keywords :
C language; Jacobian matrices; application program interfaces; iterative methods; language translation; matrix multiplication; message passing; multiprocessing systems; 3D Jacobi iterative solver; 98304 processor; Bamboo generated code; C source transform; Cannon matrix multiplication; MPI application translation; NERSC Hopper system; custom source-to-source translator; data driven form; latency tolerant; split phase coding; Geometry; Jacobian matrices; Layout; Libraries; Optimization; Program processors; Runtime;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
High Performance Computing, Networking, Storage and Analysis (SC), 2012 International Conference for
Conference_Location :
Salt Lake City, UT
ISSN :
2167-4329
Print_ISBN :
978-1-4673-0805-2
Type :
conf
DOI :
10.1109/SC.2012.23
Filename :
6468471
Link To Document :
بازگشت