• DocumentCode
    3104167
  • Title

    A preliminary topological debugger for MPI programs

  • Author

    Huband, Simon ; McDonald, Chris

  • Author_Institution
    Dept. of Comput. Sci. & Software Eng., Western Australia Univ., Nedlands, WA, Australia
  • fYear
    2001
  • fDate
    2001
  • Firstpage
    422
  • Lastpage
    429
  • Abstract
    Most parallel programs use regular topologies to support their computation. Since they define the relationship between processes, process topologies present an excellent opportunity for debugging. The primary benefit is that patterns of expected behaviour can be abstracted and identified, and unexpected behaviour reported. However, topology support is inadequate in may environments, including the popular Message Passing Interface (MPI). Programmers typically implement topology support themselves, increasing the possibility of introducing errors. Moreover, debugger support that exploit topological information is lacking. We have undertaken to develop a debugger that exploits topological information. This paper presents DEPICT (DEbugger of Parallel but Inconsistent Communication Traces), a (preliminary) topology-based debugger for MPI. Currently, DEPICT presents high-level visualisations of parallel program communication behaviour, where logically similar processes are clearly indicated in a manner that allows the programmer insight into overall program behaviour. To assist in understanding unexpected behaviour, DEPICT allows programmers to investigate the observed semantic differences between processes. In addition to its current facilities, DEPICT´s implementation details and underlying algorithms are also described
  • Keywords
    message passing; parallel programming; program debugging; program visualisation; DEPICT; DEbugger of Parallel but Inconsistent Communication Traces; MPI programs; Message Passing Interface; expected behaviour patterns; high-level visualisations; parallel programs; topological debugger; Computer science; Concurrent computing; Costs; Debugging; Parallel programming; Power engineering and energy; Programming profession; Software engineering; Supercomputers; Topology;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Cluster Computing and the Grid, 2001. Proceedings. First IEEE/ACM International Symposium on
  • Conference_Location
    Brisbane, Qld.
  • Print_ISBN
    0-7695-1010-8
  • Type

    conf

  • DOI
    10.1109/CCGRID.2001.923222
  • Filename
    923222