DocumentCode :
3013835
Title :
An analysis of communication induced checkpointing
Author :
Alvisi, L. ; Elnozahy, E. ; Rao, S. ; Husain, S.A. ; de Mel, A.
Author_Institution :
Dept. of Comput. Sci., Texas Univ., Austin, TX, USA
fYear :
1999
fDate :
15-18 June 1999
Firstpage :
242
Lastpage :
249
Abstract :
Communication induced checkpointing (CIC) allows processes in a distributed computation to take independent checkpoints and to avoid the domino effect. This paper presents an analysis of CIC protocols based on a prototype implementation and validated simulations. Our result indicate that there is sufficient evidence to suspect that much of the conventional wisdom about these protocols is questionable.
Keywords :
distributed programming; protocols; system recovery; CIC; CIC protocols; distributed computation; independent checkpoints; Analytical models; Checkpointing; Computational modeling; Electrical capacitance tomography; Protocols; Prototypes; Scalability; Virtual prototyping;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Fault-Tolerant Computing, 1999. Digest of Papers. Twenty-Ninth Annual International Symposium on
Conference_Location :
Madison, WI, USA
ISSN :
0731-3071
Print_ISBN :
0-7695-0213-X
Type :
conf
DOI :
10.1109/FTCS.1999.781058
Filename :
781058
Link To Document :
بازگشت