Title :
Automatic construction of coordinated performance skeletons
Author :
Subhlok, Jaspal ; Xu, Qiang
Author_Institution :
Dept. of Comput. Sci., Univ. of Houston, Houston, TX
Abstract :
Performance prediction is particularly challenging for dynamic and unpredictable environments that cannot be modeled well, such as execution with sharing of CPU and bandwidth resources. Our approach to performance estimation in such scenarios is based on actual execution of short running customized performance skeletons for target applications. This work focuses on automatic construction of performance skeletons for parallel MPI programs. Logicalization of a family of traces to a single trace is presented as a key technique for skeleton construction. Compression of communication traces is achieved by identifying the loop structure from traces. Results are presented that demonstrate that logicalization and compression are accurate and efficient. Automatically constructed performance skeletons were able to effectively predict application performance in a variety of scenarios involving resource sharing and changes in the execution environment.
Keywords :
message passing; parallel programming; program control structures; coordinated performance skeleton; loop structure; parallel MPI program; resource sharing; skeleton construction; Aggregates; Bandwidth; Computer science; Libraries; Message passing; Pattern analysis; Predictive models; Resource management; Skeleton; Topology;
Conference_Titel :
Parallel and Distributed Processing, 2008. IPDPS 2008. IEEE International Symposium on
Conference_Location :
Miami, FL
Print_ISBN :
978-1-4244-1693-6
Electronic_ISBN :
1530-2075
DOI :
10.1109/IPDPS.2008.4536405