• DocumentCode
    2303443
  • Title

    Recovering from process failures in the time warp mechanism

  • Author

    Agre, Jonathan R. ; Agrawal, Divyakant

  • Author_Institution
    Rockwell Int. Corp., Thousand Oaks, CA, USA
  • fYear
    1989
  • fDate
    10-12 Oct 1989
  • Firstpage
    53
  • Lastpage
    61
  • Abstract
    A recovery procedure for distributed systems using the time warp control mechanism is described. Time warp is an optimistic execution technique in which synchronization is achieved using rollback. The recovery procedure is a protocol that exploits the redundancy already available to implement process rollback in the time warp mechanism. Thus, the recovery protocol has little additional bookkeeping overhead, unlike many other recovery procedures. An informal proof of the correctness of the recovery procedure for a single process failure is presented. The protocol is extended so that it becomes resilient to multiple process failures
  • Keywords
    distributed processing; fault tolerant computing; network operating systems; redundancy; software reliability; synchronisation; system recovery; bookkeeping overhead; correctness; distributed systems; optimistic execution technique; process failures; protocol; recovery procedure; redundancy; rollback; synchronization; time warp mechanism; Checkpointing; Computer science; Concurrency control; Control systems; Discrete event simulation; Operating systems; Protocols; Time factors; Transaction databases; Workstations;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Reliable Distributed Systems, 1989., Proceedings of the Eighth Symposium on
  • Conference_Location
    Seattle, WA
  • Print_ISBN
    0-8186-1981-3
  • Type

    conf

  • DOI
    10.1109/RELDIS.1989.72748
  • Filename
    72748