DocumentCode
3104133
Title
Efficient tracing for on-the-fly space-time displays in a debugger for message passing programs
Author
Hood, Robert ; Matthews, Gregory
Author_Institution
Intelligent Syst. Technol. Branch, NASA Ames Res. Center, Moffett Field, CA, USA
fYear
2001
fDate
2001
Firstpage
394
Lastpage
403
Abstract
We describe the implementation of a practical mechanism for collecting and displaying trace information in a debugger for message passing programs. We introduce a trace format that is highly compressible while still providing information adequate for debugging purposes. We make the mechanism convenient for users to access by incorporating the trace collection in a set of wrappers for the MPI communication library. We implement several debugger operations that use the trace display: consistent stoplines, undo, and rollback. They all are implemented using controlled replay, which executes at full speed in target processes until the appropriate position in the computation is reached. They provide convenient mechanisms for getting to places in the execution where the full power of a state-based debugger can be brought to bear on isolating communication errors
Keywords
message passing; program debugging; software libraries; user interfaces; MPI communication library; communication errors; consistent stoplines; controlled replay; message passing programs; on-the-fly space-time displays; program debugging; rollback; trace format; trace information; undo; wrappers; Communication system control; Computer displays; Contracts; Data visualization; History; Libraries; Message passing; NASA; Software debugging; User interfaces;
fLanguage
English
Publisher
ieee
Conference_Titel
Cluster Computing and the Grid, 2001. Proceedings. First IEEE/ACM International Symposium on
Conference_Location
Brisbane, Qld.
Print_ISBN
0-7695-1010-8
Type
conf
DOI
10.1109/CCGRID.2001.923219
Filename
923219
Link To Document