DocumentCode
2303443
Title
Recovering from process failures in the time warp mechanism
Author
Agre, Jonathan R. ; Agrawal, Divyakant
Author_Institution
Rockwell Int. Corp., Thousand Oaks, CA, USA
fYear
1989
fDate
10-12 Oct 1989
Firstpage
53
Lastpage
61
Abstract
A recovery procedure for distributed systems using the time warp control mechanism is described. Time warp is an optimistic execution technique in which synchronization is achieved using rollback. The recovery procedure is a protocol that exploits the redundancy already available to implement process rollback in the time warp mechanism. Thus, the recovery protocol has little additional bookkeeping overhead, unlike many other recovery procedures. An informal proof of the correctness of the recovery procedure for a single process failure is presented. The protocol is extended so that it becomes resilient to multiple process failures
Keywords
distributed processing; fault tolerant computing; network operating systems; redundancy; software reliability; synchronisation; system recovery; bookkeeping overhead; correctness; distributed systems; optimistic execution technique; process failures; protocol; recovery procedure; redundancy; rollback; synchronization; time warp mechanism; Checkpointing; Computer science; Concurrency control; Control systems; Discrete event simulation; Operating systems; Protocols; Time factors; Transaction databases; Workstations;
fLanguage
English
Publisher
ieee
Conference_Titel
Reliable Distributed Systems, 1989., Proceedings of the Eighth Symposium on
Conference_Location
Seattle, WA
Print_ISBN
0-8186-1981-3
Type
conf
DOI
10.1109/RELDIS.1989.72748
Filename
72748
Link To Document