DocumentCode :
2798763
Title :
Transformations to Parallel Codes for Communication-Computation Overlap
Author :
Danalis, Anthony ; Kim, Ki-Yong ; Pollock, Lori ; Swany, Martin
Author_Institution :
University of Delaware
fYear :
2005
fDate :
12-18 Nov. 2005
Firstpage :
58
Lastpage :
58
Abstract :
This paper presents program transformations directed toward improving communication-computation overlap in parallel programs that use MPI’s collective operations. Our transformations target a wide variety of applications focusing on scientific codes with computation loops that exhibit limited dependence among iterations. We include guidance for developers for transforming an application code in order to exploit the communicationcomputation overlap available in the underlying cluster, as well as a discussion of the performance improvements achieved by our transformations. We present results from a detailed study of the effect of the problem and message size, level of communication-computation overlap, and amount of communication aggregation on runtime performance in a cluster environment based on an RDMA-enabled network. The targets of our study are two scientific codes written by domain scientists, but the applicability of our work extends far beyond the scope of these two applications.
Keywords :
Application software; Bandwidth; Concurrent computing; Delay; Parallel processing; Permission; Power engineering and energy; Runtime environment; Tiles; Workstations;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Supercomputing, 2005. Proceedings of the ACM/IEEE SC 2005 Conference
Print_ISBN :
1-59593-061-2
Type :
conf
DOI :
10.1109/SC.2005.75
Filename :
1560010
Link To Document :
بازگشت