Title :
A non-blocking recovery algorithm for causal message logging
Author :
Mitchell, J. Roger ; Garg, Vijay K.
Author_Institution :
Dept. of Electr. & Comput. Eng., Texas Univ., Austin, TX, USA
Abstract :
In the recovery of failed processes in a distributed program, causal logging schemes offer several benefits. These benefits include no rollback of unfailed processes and simple approaches to output commit. Unfortunately, previous approaches to the recovery of multiple simultaneous failures require that the distributed execution be blocked or that recovering processes coordinate. The latter requires assumptions which are not satisfactory. In this paper we present a solution that has neither of these drawbacks
Keywords :
distributed algorithms; distributed programming; message passing; software fault tolerance; system recovery; causal message logging; distributed program; failed process recovery; nonblocking recovery algorithm; output commit; rollback; Computer crashes; History; Laboratories; Optimization methods;
Conference_Titel :
Reliable Distributed Systems, 1998. Proceedings. Seventeenth IEEE Symposium on
Conference_Location :
West Lafayette, IN
Print_ISBN :
0-8186-9218-9
DOI :
10.1109/RELDIS.1998.740468