Title :
An index-based checkpointing algorithm for autonomous distributed systems
Author :
Baldoni, Roberto ; Quaglia, Francesco ; Fornara, Paolo
Author_Institution :
Dipt. di Inf. e Sistemistica, Rome Univ., Italy
Abstract :
The paper presents an index based checkpointing algorithm for distributed systems with the aim of reducing the total number of checkpoints while ensuring that each checkpoint belongs to at least one consistent global checkpoint (or recovery line). The algorithm is based on an equivalence relation defined between pairs of successive checkpoints of a process which allows, in some cases, to advance the recovery line of the computation without forcing check points in other processes. This protocol shows good performance, especially in autonomous environments, where each process does not have any private information about other processes
Keywords :
distributed processing; fault tolerant computing; reliability; software fault tolerance; system recovery; autonomous distributed systems; autonomous environments; consistent global checkpoint; equivalence relation; index based checkpointing algorithm; protocol; recovery line; successive checkpoints; Algorithm design and analysis; Checkpointing; Communication system control; Contracts; Distributed computing; Fault tolerant systems; Force control; Process design; Protocols; Remuneration;
Conference_Titel :
Reliable Distributed Systems, 1997. Proceedings., The Sixteenth Symposium on
Conference_Location :
Durham, NC
Print_ISBN :
0-8186-8177-2
DOI :
10.1109/RELDIS.1997.632793