DocumentCode
3502045
Title
A Performance Instrumentation Framework to Characterize Computation-Communication Overlap in Message-Passing Systems
Author
Shet, Aniruddha G. ; Sadayappan, P. ; Bernholdt, David E. ; Nieplocha, Jarek ; Tipparaju, Vinod
Author_Institution
Ohio State Univ., Columbus, OH
fYear
2006
fDate
25-28 Sept. 2006
Firstpage
1
Lastpage
12
Abstract
Effective overlap of computation and communication is a well understood technique for latency hiding and can yield significant performance gains for applications on high-end computers. In this paper, we propose an instrumentation framework for message-passing systems to characterize the degree of overlap of communication with computation in the execution of parallel applications. The inability to obtain precise time-stamps for pertinent communication events is a significant problem, and is addressed by generation of minimum and maximum bounds on achieved overlap. The overlap measures can aid application developers and system designers in investigating scalability issues. The approach has been used to instrument two MPI implementations as well as the ARMCI system. The implementation resides entirely within the communication library and thus integrates well with existing approaches that operate outside the library. The usefulness of the framework is shown by analyzing available overlap for microbenchmarks and NAS benchmarks, and the insights obtained are used to improve achieved overlap by modifying the NAS SP benchmark
Keywords
message passing; parallel processing; ARMCI system; MPI implementations; NAS SP benchmark; NAS benchmarks; communication library; computation-communication overlap; high-end computers; latency hiding; message-passing systems; microbenchmarks; parallel applications; performance instrumentation framework; pertinent communication events; system designers; Application software; Computer architecture; Concurrent computing; Delay; Engines; High performance computing; Instruments; Laboratories; Performance gain; Software libraries;
fLanguage
English
Publisher
ieee
Conference_Titel
Cluster Computing, 2006 IEEE International Conference on
Conference_Location
Barcelona
ISSN
1552-5244
Print_ISBN
1-4244-0327-8
Electronic_ISBN
1552-5244
Type
conf
DOI
10.1109/CLUSTR.2006.311887
Filename
4100393
Link To Document