DocumentCode :
3469049
Title :
Reliable probabilistic checkpointing
Author :
Nam, Hyo-Chang ; Kim, Jong ; Hong, Sungje ; Lee, Sunggu
Author_Institution :
Dept. of Comput. Sci. & Eng., Pohang Inst. of Sci. & Technol., South Korea
fYear :
1999
fDate :
1999
Firstpage :
153
Lastpage :
160
Abstract :
Recently proposed probabilistic checkpointing has one drawback, naming aliasing. When analyzed, 64-bit signatures show negligible possibility of aliasing. But in practice, the shift-XOR signature generation function used with probabilistic checkpointing shows a high aliasing rate, which limits the practicality of probabilistic checkpointing. In this paper, two enhancements are considered to make probabilistic checkpointing more reliable. One is the signature generation function and the other is the recovery scheme. In the signature generation function part, we propose two signature generation functions: HALF for small block sizes (less than or equal to 256 bytes) and C-HALF(CRC combined HALF) for large block sizes (larger than 256 bytes), which have an aliasing probability similar to analytic results and small overhead. In the recovery scheme part, we propose a recovery scheme which ensures the safety of probabilistic checkpointing. To examine the correctness of previous checkpoints at recovery time, the proposed recovery scheme uses a spare node. We analyze the recovery scheme using a mathematical model. Also an optimal checkpoint interval is derived using the model
Keywords :
probability; software fault tolerance; system recovery; aliasing; checkpoints; mathematical model; reliable probabilistic checkpointing; shift-XOR signature generation function; system recovery scheme; Checkpointing; Computer science; Costs; Electronic switching systems; Failure analysis; Fault tolerance; Mathematical model; Protection; Reliability engineering; Safety;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Dependable Computing, 1999. Proceedings. 1999 Pacific Rim International Symposium on
Print_ISBN :
0-7695-0371-3
Type :
conf
DOI :
10.1109/PRDC.1999.816224
Filename :
816224
Link To Document :
بازگشت