DocumentCode :
3423206
Title :
Checkpointing multicomputer applications
Author :
Li, Kai ; Naughton, J.F. ; Planck, J.S.
Author_Institution :
Dept. of Comput. Sci., Princeton Univ., NJ, USA
fYear :
1991
fDate :
30 Sep-2 Oct 1991
Firstpage :
2
Lastpage :
11
Abstract :
The authors present a checkpointing scheme that is transparent, imposes overhead only during checkpoints, requires minimal message logging, and allows for quick resumption of execution from a checkpointed image. Since checkpointing multicomputer applications poses requirements different from those posed by checkpointing general distributed systems, existing distributed checkpointing schemes are inadequate for multicomputer checkpointing. The proposed checkpointing scheme makes use of special properties of multicomputer interconnection networks to satisfy this set of requirements. The proposed algorithm is efficient both when taking checkpoints and when recovering from checkpointed images
Keywords :
fault tolerant computing; multiprocessor interconnection networks; performance evaluation; checkpointing scheme; minimal message logging; multicomputer applications; multicomputer interconnection networks; Application software; Checkpointing; Computer applications; Computer science; Distributed databases; Hardware; Power system interconnection; Resumes; Time sharing computer systems; Transaction databases;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Reliable Distributed Systems, 1991. Proceedings., Tenth Symposium on
Conference_Location :
Pisa
Print_ISBN :
0-8186-2260-1
Type :
conf
DOI :
10.1109/RELDIS.1991.145398
Filename :
145398
Link To Document :
بازگشت