Title :
Optimistic crash recovery without changing application messages
Author :
Venkatesan, S. ; Juang, Tony Tony-Ying ; Alagar, Sridhar
Author_Institution :
Comput. Sci. Program, Texas Univ., Richardson, TX, USA
fDate :
3/1/1997 12:00:00 AM
Abstract :
We present an optimistic crash recovery technique without any communication overhead during normal operations of the distributed system. Our technique does not append any information to the application messages, it does not suffer from the domino effect, and each processor rolls back at most once during recovery. We present three distributed rollback algorithms, their complexities, and correctness proofs. Their performances are measured through extensive simulations
Keywords :
communication complexity; distributed algorithms; system recovery; application messages; communication overhead; complexities; correctness proofs; distributed rollback algorithms; distributed system; optimistic crash recovery technique; processor rollback; Application software; Checkpointing; Communication channels; Communication networks; Computer crashes; Computer science; Distributed algorithms; Distributed computing; History; Performance evaluation;
Journal_Title :
Parallel and Distributed Systems, IEEE Transactions on