• DocumentCode
    322512
  • Title

    Re-execution of distributed programs to detect bugs hidden by racing messages

  • Author

    Kilgore, Richard ; Chase, Craig

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Texas Univ., Austin, TX, USA
  • Volume
    1
  • fYear
    1997
  • fDate
    7-10 Jan 1997
  • Firstpage
    423
  • Abstract
    Finding errors in non-deterministic programs is complicated by the fact that an anomaly may occur during one program execution and not the next. Our objective is to provide a practical, yet powerful testing environment for distributed systems, using re-execution. We focus on re-executing the program under a strictly different message ordering. We show that messages are grouped into waves, such that any two messages from different waves must always be received in the same order. We provide an algorithm that produces a re-execution that maximizes the number of reordered pairs of message delivery events. We also provide an efficient online algorithm for detecting racing messages
  • Keywords
    distributed algorithms; hazards and race conditions; program debugging; program testing; distributed program re-execution; hidden bug detection; message delivery events; message ordering; message waves; non-deterministic programs; online algorithm; program error finding; program execution anomalies; racing message detection; reordered pairs; software testing environment; Application software; Computer bugs; Debugging; Differential equations; Distributed computing; Power engineering and energy; Power engineering computing; Programming profession; Software testing; System testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    System Sciences, 1997, Proceedings of the Thirtieth Hawaii International Conference on
  • Conference_Location
    Wailea, HI
  • ISSN
    1060-3425
  • Print_ISBN
    0-8186-7743-0
  • Type

    conf

  • DOI
    10.1109/HICSS.1997.667295
  • Filename
    667295