DocumentCode :
1704588
Title :
Preventing useless checkpoints in distributed computations
Author :
Helary, Jean-Michel ; Mostefaoui, Achour ; Netzer, Robert H B ; Raynal, Michel
Author_Institution :
IRISA, Rennes, France
fYear :
1997
Firstpage :
183
Lastpage :
190
Abstract :
A useless checkpoint is a local checkpoint that cannot be part of a consistent global checkpoint. The paper addresses the following important problem. Given a set of processes that take (basic) local checkpoints in an independent and unknown way, the problem is to design a communication induced checkpointing protocol that directs processes to take additional local (forced) checkpoints to ensure that no local checkpoint is useless. A general and efficient protocol answering this problem is proposed. It is shown that several existing protocols that solve the same problem are particular instances of it. The design of this general protocol is motivated by the use of communication induced checkpointing protocols in “consistent global checkpoint” based distributed applications. Detection of stable or unstable properties, rollback recovery and determination of distributed breakpoints are examples of such applications
Keywords :
distributed processing; fault tolerant computing; performance evaluation; protocols; reliability; software fault tolerance; system recovery; basic local checkpoints; communication induced checkpointing protocol; communication induced checkpointing protocols; consistent global checkpoint based distributed applications; distributed breakpoints; distributed computations; efficient protocol; local checkpoint; local forced checkpoints; rollback recovery; unstable properties; useless checkpoints; Checkpointing; Communication system control; Computational modeling; Computer science; Degradation; Distributed computing; Process control; Protocols;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Reliable Distributed Systems, 1997. Proceedings., The Sixteenth Symposium on
Conference_Location :
Durham, NC
ISSN :
1060-9857
Print_ISBN :
0-8186-8177-2
Type :
conf
DOI :
10.1109/RELDIS.1997.632814
Filename :
632814
Link To Document :
بازگشت