DocumentCode
3428103
Title
Novel views of performance data to analyze large-scale adaptive applications
Author
Bhatele, Abhinav ; Gamblin, Todd ; Isaacs, Katherine E. ; Gunney, B.T.N. ; Schulz, Markus ; Bremer, Peer-Timo ; Hamann, Bernd
Author_Institution
Center for Appl. Sci. Comput., Lawrence Livermore Nat. Lab., Livermore, CA, USA
fYear
2012
fDate
10-16 Nov. 2012
Firstpage
1
Lastpage
11
Abstract
Performance analysis of parallel scientific codes is becoming increasingly difficult due to the rapidly growing complexity of applications and architectures. Existing tools fall short in providing intuitive views that facilitate the process of performance debugging and tuning. In this paper, we extend recent ideas of projecting and visualizing performance data for faster, more intuitive analysis of applications. We collect detailed per-level and per-phase measurements for a dynamically load-balanced, structured AMR library and project per-core data collected in the hardware domain on to the application´s communication topology. We show how our projections and visualizations lead to a rapid diagnosis of and mitigation strategy for a previously elusive scaling bottleneck in the library that is hard to detect using conventional tools. Our new insights have resulted in a 22% performance improvement for a 65,536-core run of the AMR library on an IBM Blue Gene/P system.
Keywords
computer debugging; data analysis; data visualisation; mesh generation; multiprocessing systems; parallel processing; performance evaluation; IBM Blue Gene/P system; adaptive mesh refinement; application communication topology; large-scale adaptive applications; load-balanced AMR library; mitigation strategy; parallel scientific codes; per-core data; per-level measurements; per-phase measurements; performance analysis; performance data projection; performance data visualization; performance debugging process; performance improvement; performance tuning process; structured AMR library; Data visualization; Hardware; Libraries; Load management; Performance analysis; Program processors; Scalability;
fLanguage
English
Publisher
ieee
Conference_Titel
High Performance Computing, Networking, Storage and Analysis (SC), 2012 International Conference for
Conference_Location
Salt Lake City, UT
ISSN
2167-4329
Print_ISBN
978-1-4673-0805-2
Type
conf
DOI
10.1109/SC.2012.80
Filename
6468450
Link To Document