Title :
On the effectiveness of distributed checkpoint algorithms for domino-free recovery
Author :
Zambonelli, Franco
Author_Institution :
Dipt. di Sci. dell´´Ingegneria, Modena Univ., Italy
Abstract :
The paper focuses on fault-tolerant distributed computations where processes can take local checkpoints without coordinating with each other. Several distributed online algorithms are presented which avoid rollback propagation by forcing additional local checkpoints in processes. The effectiveness of the algorithms is evaluated in several application examples, showing their limited capability of bounding the number of additional checkpoints
Keywords :
distributed algorithms; online operation; software fault tolerance; system recovery; additional checkpoint bounding; distributed checkpoint algorithms; distributed online algorithms; domino-free recovery; fault-tolerant distributed computations; local checkpoints; rollback propagation; Algorithm design and analysis; Checkpointing; Communication system control; Computer crashes; Distributed computing; Monitoring;
Conference_Titel :
High Performance Distributed Computing, 1998. Proceedings. The Seventh International Symposium on
Conference_Location :
Chicago, IL
Print_ISBN :
0-8186-8579-4
DOI :
10.1109/HPDC.1998.709964