• DocumentCode
    2960080
  • Title

    An Efficient Framework for Multi-dimensional Tuning of High Performance Computing Applications

  • Author

    Cong, Guojing ; Wen, Huifang ; Chung, I-Hsin ; Klepacki, David ; Murata, Hiroki ; Negishi, Yasushi

  • Author_Institution
    IBM TJ Watson Res. Center, Yorktown Heights, NY, USA
  • fYear
    2012
  • fDate
    21-25 May 2012
  • Firstpage
    1376
  • Lastpage
    1387
  • Abstract
    Deploying an application onto a target platform for high performance oftentimes demands manual tuning by experts. As machine architecture gets increasingly complex, tuning becomes even more challenging and calls for systematic approaches. In our earlier work we presented a prototype that combines efficiently expert knowledge, static analysis, and runtime observation for bottleneck detection, and employs refactoring and compiler feedback for mitigation. In this study, we develop a software tool that facilitates emph{fast} searching of bottlenecks and effective mitigation of problems from major dimensions of computing (e.g., computation, communication, and I/O). The impact of our approach is demonstrated by the tuning of the LBMHD code and a Poisson solver code, representing traditional scientific codes, and a graph analysis code in UPC, representing emerging programming paradigms. In the experiments, our framework detects with a single run of the application intricate bottlenecks of memory access, I/O, and communication. Moreover, the automated solution implementation yields significant overall performance improvement on the target platforms. The improvement for LBMHD is up to 45%, and the speedup for the UPC code is up to 5. These results suggest that our approach is a concrete step towards systematic tuning of high performance computing applications.
  • Keywords
    Poisson distribution; computer architecture; graph theory; parallel machines; software tools; LBMHD code; Poisson solver code; UPC; compiler feedback; expert knowledge; graph analysis code; high performance computing; machine architecture; memory access; multidimensional tuning; refactoring; runtime observation; software tool; static analysis; Libraries; Measurement; Probes; Runtime; Software; Tuning;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel & Distributed Processing Symposium (IPDPS), 2012 IEEE 26th International
  • Conference_Location
    Shanghai
  • ISSN
    1530-2075
  • Print_ISBN
    978-1-4673-0975-2
  • Type

    conf

  • DOI
    10.1109/IPDPS.2012.124
  • Filename
    6267938