DocumentCode :
1857635
Title :
Unveiling Internal Evolution of Parallel Application Computation Phases
Author :
Servat, Harald ; Llort, Germán ; Giménez, Judit ; Huck, Kevin ; Labarta, Jesús
Author_Institution :
Barcelona Supercomput. Center, Univ. Politec. de Catalunya, Barcelona, Spain
fYear :
2011
fDate :
13-16 Sept. 2011
Firstpage :
155
Lastpage :
164
Abstract :
As access to supercomputing resources is becoming more and more commonplace, performance analysis tools are gaining importance in order to decrease the gap between the application performance and the supercomputers´ peak performance. Performance analysis tools allow the analyst to understand the idiosyncrasies of an application in order to improve it. However, these tools require monitoring regions of the application to provide information to the analysts, leaving non-monitored regions of code unknown, which may result in lack of understanding of important regions of the application. In this paper we describe an automated methodology that reports very detailed application insights and improves the analysis experience of performance tools based on traces. We apply this methodology to three production applications and provide suggestions on how to improve their performance. Our methodology uses computation burst clustering and a mechanism called folding. While clustering automatically detects application structure, folding combines instrumentation and sampling to augment the performance analysis details. Folding provides fine grain performance information from coarse grain sampling on iterative applications. Folding results closely resemble the performance data gathered from fine grain sampling with an absolute mean difference less than 5% without overhead of fine grain.
Keywords :
iterative methods; parallel processing; pattern clustering; performance evaluation; sampling methods; absolute mean difference; coarse grain sampling; computation burst clustering; fine grain sampling; folding mechanism; iterative application; parallel application computation phase; performance analysis tools; supercomputing resource; Algorithm design and analysis; Benchmark testing; Hardware; Measurement; Probes; Radiation detectors; clustering; instrumentation; performance analysis; sampling;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel Processing (ICPP), 2011 International Conference on
Conference_Location :
Taipei City
ISSN :
0190-3918
Print_ISBN :
978-1-4577-1336-1
Electronic_ISBN :
0190-3918
Type :
conf
DOI :
10.1109/ICPP.2011.35
Filename :
6047184
Link To Document :
بازگشت