Title :
Identifying computational phases from inter-process communication traces of HPC applications
Author :
Alawneh, Luay ; Hamou-Lhadj, Abdelwahab
Author_Institution :
Dept. of Electr. & Comput. Eng., Concordia Univ., Montreal, QC, Canada
Abstract :
Understanding the behaviour of High Performance Computing (HPC) systems is a challenging task due to the large number of processes they involve as well as the complex interactions among these processes. In this paper, we present a novel approach that aims to simplify the analysis of large execution traces generated from HPC applications. We achieve this through a technique that allows semiautomatic extraction of execution phases from large traces. These phases, which characterize the main computations of the traced scenario, can be used by software engineers to browse the content of a trace at different levels of abstraction. Our approach is based on the application of information theory principles to the analysis of sequences of communication patterns found in HPC traces. The results of the proposed approach when applied to traces of a large HPC industrial system demonstrate its effectiveness in identifying the main program phases and their corresponding sub-phases.
Keywords :
distributed processing; information theory; HPC applications; HPC industrial system; communication pattern sequence; computational phase identification; execution phase semiautomatic extraction; execution trace analysis; high performance computing systems; information theory principles; interprocess communication traces; program phases; Algorithm design and analysis; Data models; Entropy; Equations; Message passing; Phase detection; Topology; dynamic analysis; execution phases; high performance computing aystems; inter-process communication traces; program comprehension;
Conference_Titel :
Program Comprehension (ICPC), 2012 IEEE 20th International Conference on
Conference_Location :
Passau
Print_ISBN :
978-1-4673-1213-4
Electronic_ISBN :
1092-8138
DOI :
10.1109/ICPC.2012.6240481