Title :
Independent node and process recovery in message passing distributed systems
Author :
Bhalla, Subhash ; Sreenivas, M.V.
Author_Institution :
Database Syst. Lab, Univ. of Aizu, Fukushima, Japan
Abstract :
Consistent recovery from process failures is an essential component of reliable distributed systems. Many existing recovery techniques use asynchronous message logging and checkpoints. Most of the present approaches depend on logged states of non-fail processes for recovery. A model of recovery based on the current active states of processes has been proposed. The algorithm considers recoverable state of the failed process and current states of the non-failed processes. Each process recovers to a consistent system state independently
Keywords :
distributed processing; fault tolerant computing; message passing; asynchronous message logging; checkpoints; independent node recovery; message passing distributed systems; process failures; process recovery; recovery techniques; reliable distributed systems; Checkpointing; Cities and towns; Computer crashes; Database systems; Fault tolerant systems; Laboratories; Message passing; Power system faults; Power system protection;
Conference_Titel :
High Performance Computing, 1996. Proceedings. 3rd International Conference on
Conference_Location :
Trivandrum
Print_ISBN :
0-8186-7557-8
DOI :
10.1109/HIPC.1996.565836