DocumentCode :
2819528
Title :
Design and analysis of an efficient algorithm for coordinated checkpointing in distributed systems
Author :
Cao, Jiannong ; Jia, Weijia ; Jia, Xiaohua ; Cheung, To-yat
Author_Institution :
Dept. of Comput. Sci., City Univ. of Hong Kong, Hong Kong
fYear :
1997
fDate :
19-21 Mar 1997
Firstpage :
261
Lastpage :
268
Abstract :
A synchronous checkpointing algorithm coordinates a set of processes in taking checkpoints in such a way that the set of local checkpoints always forms part of a consistent global system state. Whenever a process p requests to take a checkpoint, a set of processes, called the cohorts set of p, must be checked and some of them may also have to take their checkpoints in order to preserve system consistency. Although several synchronous checkpointing algorithms have been proposed in the literature, most of them do not address the performance issue. In this paper we propose an efficient distributed algorithm for synchronous checkpointing. Proof of correctness and analysis of efficiency of the algorithm are presented. It is shown that the algorithm has a better message and time complexity than the existing algorithms. The method proposed in this paper can also be applied to enhance the performance of rollback operation which always require synchronization of the inter-dependent processes
Keywords :
computational complexity; concurrency control; distributed algorithms; distributed processing; fault tolerant computing; system recovery; checkpoint; coordinated checkpointing; distributed algorithm; distributed systems; fault tolerance; global system state; message complexity; rollback operation; rollback recovery; time complexity; Algorithm design and analysis; Checkpointing; Computer science; Debugging; Distributed algorithms; Fault tolerant systems; Merging; Multicast algorithms;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Advances in Parallel and Distributed Computing, 1997. Proceedings
Conference_Location :
Shanghai
Print_ISBN :
0-8186-7876-3
Type :
conf
DOI :
10.1109/APDC.1997.574042
Filename :
574042
Link To Document :
بازگشت