DocumentCode
3104167
Title
A preliminary topological debugger for MPI programs
Author
Huband, Simon ; McDonald, Chris
Author_Institution
Dept. of Comput. Sci. & Software Eng., Western Australia Univ., Nedlands, WA, Australia
fYear
2001
fDate
2001
Firstpage
422
Lastpage
429
Abstract
Most parallel programs use regular topologies to support their computation. Since they define the relationship between processes, process topologies present an excellent opportunity for debugging. The primary benefit is that patterns of expected behaviour can be abstracted and identified, and unexpected behaviour reported. However, topology support is inadequate in may environments, including the popular Message Passing Interface (MPI). Programmers typically implement topology support themselves, increasing the possibility of introducing errors. Moreover, debugger support that exploit topological information is lacking. We have undertaken to develop a debugger that exploits topological information. This paper presents DEPICT (DEbugger of Parallel but Inconsistent Communication Traces), a (preliminary) topology-based debugger for MPI. Currently, DEPICT presents high-level visualisations of parallel program communication behaviour, where logically similar processes are clearly indicated in a manner that allows the programmer insight into overall program behaviour. To assist in understanding unexpected behaviour, DEPICT allows programmers to investigate the observed semantic differences between processes. In addition to its current facilities, DEPICT´s implementation details and underlying algorithms are also described
Keywords
message passing; parallel programming; program debugging; program visualisation; DEPICT; DEbugger of Parallel but Inconsistent Communication Traces; MPI programs; Message Passing Interface; expected behaviour patterns; high-level visualisations; parallel programs; topological debugger; Computer science; Concurrent computing; Costs; Debugging; Parallel programming; Power engineering and energy; Programming profession; Software engineering; Supercomputers; Topology;
fLanguage
English
Publisher
ieee
Conference_Titel
Cluster Computing and the Grid, 2001. Proceedings. First IEEE/ACM International Symposium on
Conference_Location
Brisbane, Qld.
Print_ISBN
0-7695-1010-8
Type
conf
DOI
10.1109/CCGRID.2001.923222
Filename
923222
Link To Document