DocumentCode
1687244
Title
Automatic construction of coordinated performance skeletons
Author
Subhlok, Jaspal ; Xu, Qiang
Author_Institution
Dept. of Comput. Sci., Univ. of Houston, Houston, TX
fYear
2008
Firstpage
1
Lastpage
5
Abstract
Performance prediction is particularly challenging for dynamic and unpredictable environments that cannot be modeled well, such as execution with sharing of CPU and bandwidth resources. Our approach to performance estimation in such scenarios is based on actual execution of short running customized performance skeletons for target applications. This work focuses on automatic construction of performance skeletons for parallel MPI programs. Logicalization of a family of traces to a single trace is presented as a key technique for skeleton construction. Compression of communication traces is achieved by identifying the loop structure from traces. Results are presented that demonstrate that logicalization and compression are accurate and efficient. Automatically constructed performance skeletons were able to effectively predict application performance in a variety of scenarios involving resource sharing and changes in the execution environment.
Keywords
message passing; parallel programming; program control structures; coordinated performance skeleton; loop structure; parallel MPI program; resource sharing; skeleton construction; Aggregates; Bandwidth; Computer science; Libraries; Message passing; Pattern analysis; Predictive models; Resource management; Skeleton; Topology;
fLanguage
English
Publisher
ieee
Conference_Titel
Parallel and Distributed Processing, 2008. IPDPS 2008. IEEE International Symposium on
Conference_Location
Miami, FL
ISSN
1530-2075
Print_ISBN
978-1-4244-1693-6
Electronic_ISBN
1530-2075
Type
conf
DOI
10.1109/IPDPS.2008.4536405
Filename
4536405
Link To Document