Title :
CYPRESS: Combining Static and Dynamic Analysis for Top-Down Communication Trace Compression
Author :
Jidong Zhai ; Jianfei Hu ; Xiongchao Tang ; Xiaosong Ma ; Wenguang Chen
Author_Institution :
Tsinghua Univ., Beijing, China
Abstract :
Communication traces are increasingly important, both for parallel applications\´ performance analysis/optimization, and for designing next-generation HPC systems. Meanwhile, the problem size and the execution scale on supercomputers keep growing, producing prohibitive volume of communication traces. To reduce the size of communication traces, existing dynamic compression methods introduce large compression overhead with the job scale. We propose a hybrid static-dynamic method that leverages information acquired from static analysis to facilitate more effective and efficient dynamic trace compression. Our proposed scheme, Cypress, extracts a program communication structure tree at compile time using inter-procedural analysis. This tree naturally contains crucial iterative computing features such as the loop structure, allowing subsequent runtime compression to "fill in", in a "top-down" manner, event details into the known communication template. Results show that Cypress reduces intra-process and inter-process compression overhead up to 5× and 9× respectively over state-of-the-art dynamic methods, while only introducing very low compiling overhead.
Keywords :
parallel machines; program diagnostics; software performance evaluation; trees (mathematics); CYPRESS; communication template; compile time; dynamic analysis; dynamic compression methods; dynamic trace compression; execution scale; hybrid static-dynamic method; interprocedural analysis; interprocess compression overhead; intraprocess compression overhead; iterative computing features; loop structure; next-generation HPC systems; parallel application performance analysis; parallel application performance optimization; program communication structure; static analysis; supercomputers; top-down communication trace compression; Algorithm design and analysis; Asynchronous communication; Data structures; Educational institutions; Libraries; Performance analysis; Runtime; High Performance Computing; Message Passing; Performance Analysis; Trace Compression;
Conference_Titel :
High Performance Computing, Networking, Storage and Analysis, SC14: International Conference for
Conference_Location :
New Orleans, LA
Print_ISBN :
978-1-4799-5499-5