• DocumentCode
    598573
  • Title

    MPI runtime error detection with MUST: Advances in deadlock detection

  • Author

    Hilbrich, Tobias ; Protze, Joachim ; Schulz, Markus ; de Supinski, Bronis R. ; Muller, Matthias S.

  • Author_Institution
    ZIH, Tech. Univ. Dresden, Dresden, Germany
  • fYear
    2012
  • fDate
    10-16 Nov. 2012
  • Firstpage
    1
  • Lastpage
    10
  • Abstract
    The widely used Message Passing Interface (MPI) is complex and rich. As a result, application developers require automated tools to avoid and to detect MPI programming errors. We present the Marmot Umpire Scalable Tool (MUST) that detects such errors with significantly increased scalability. We present improvements to our graph-based deadlock detection approach for MPI, which cover future MPI extensions. Our enhancements also check complex MPI constructs that no previous graph-based detection approach handled correctly. Finally, we present optimizations for the processing of MPI operations that reduce runtime deadlock detection overheads. Existing approaches often require O(p) analysis time per MPI operation, for p processes. We empirically observe that our improvements lead to sub-linear or better analysis time per operation for a wide range of real world applications.
  • Keywords
    application program interfaces; graph theory; message passing; optimisation; parallel programming; MPI programming errors; MPI runtime error detection; MUST; Marmot umpire scalable tool; automated tools; deadlock detection; graph based deadlock detection; message passing interface; optimizations; Analytical models; Mathematical model; Optimization; Runtime; Semantics; System recovery; Visualization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    High Performance Computing, Networking, Storage and Analysis (SC), 2012 International Conference for
  • Conference_Location
    Salt Lake City, UT
  • ISSN
    2167-4329
  • Print_ISBN
    978-1-4673-0805-2
  • Type

    conf

  • DOI
    10.1109/SC.2012.79
  • Filename
    6468449