• DocumentCode
    3335445
  • Title

    Completely asynchronous optimistic recovery with minimal rollbacks

  • Author

    Smith, S.W. ; Johnson, D.B. ; Tygar, J.D.

  • Author_Institution
    Dept. of Comput. Sci., Carnegie Mellon Univ., Pittsburgh, PA, USA
  • fYear
    1995
  • fDate
    27-30 June 1995
  • Firstpage
    361
  • Lastpage
    370
  • Abstract
    Consider the problem of transparently recovering an asynchronous distributed computation when one or more processes fail. Basing rollback recovery on optimistic message logging and replay is desirable for several reasons, including not requiring synchronization between processes during failure-free operation. However previous optimistic rollback recovery protocols either have required synchronization during recovery, or have permitted a failure at one process to potentially trigger an exponential number of process rollbacks. We present an optimistic rollback recovery protocol that provides completely asynchronous recovery, while also reducing the number of times a process must roll back in response to a failure to at most one. This protocol is based on comparing timestamp vectors across multiple levels of partial order time.<>
  • Keywords
    distributed processing; protocols; software fault tolerance; system recovery; asynchronous distributed computation; completely asynchronous optimistic recovery; failure-free operation; minimal rollbacks; optimistic message logging; optimistic rollback recovery protocols; partial order time; process failure; replay; rollback recovery; timestamp vectors; transparent recovery; Computer science; Contracts; Distributed computing; Government; Postal services; Protocols;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Fault-Tolerant Computing, 1995. FTCS-25. Digest of Papers., Twenty-Fifth International Symposium on
  • Conference_Location
    Pasadena, CA, USA
  • Print_ISBN
    0-8186-7079-7
  • Type

    conf

  • DOI
    10.1109/FTCS.1995.466963
  • Filename
    466963