DocumentCode
322512
Title
Re-execution of distributed programs to detect bugs hidden by racing messages
Author
Kilgore, Richard ; Chase, Craig
Author_Institution
Dept. of Electr. & Comput. Eng., Texas Univ., Austin, TX, USA
Volume
1
fYear
1997
fDate
7-10 Jan 1997
Firstpage
423
Abstract
Finding errors in non-deterministic programs is complicated by the fact that an anomaly may occur during one program execution and not the next. Our objective is to provide a practical, yet powerful testing environment for distributed systems, using re-execution. We focus on re-executing the program under a strictly different message ordering. We show that messages are grouped into waves, such that any two messages from different waves must always be received in the same order. We provide an algorithm that produces a re-execution that maximizes the number of reordered pairs of message delivery events. We also provide an efficient online algorithm for detecting racing messages
Keywords
distributed algorithms; hazards and race conditions; program debugging; program testing; distributed program re-execution; hidden bug detection; message delivery events; message ordering; message waves; non-deterministic programs; online algorithm; program error finding; program execution anomalies; racing message detection; reordered pairs; software testing environment; Application software; Computer bugs; Debugging; Differential equations; Distributed computing; Power engineering and energy; Power engineering computing; Programming profession; Software testing; System testing;
fLanguage
English
Publisher
ieee
Conference_Titel
System Sciences, 1997, Proceedings of the Thirtieth Hawaii International Conference on
Conference_Location
Wailea, HI
ISSN
1060-3425
Print_ISBN
0-8186-7743-0
Type
conf
DOI
10.1109/HICSS.1997.667295
Filename
667295
Link To Document