Title :
An improved schema of coordinated checkpointing protocol for distributed systems based on popular process
Author :
Abdelhafidi, Z. ; Djoudi, M. ; Yagoubi, M.B.
Author_Institution :
Comput. Sci. & Math. Lab., Laghouat, Algeria
Abstract :
In this paper, we propose an improved scheme of non-blocking checkpointing algorithm for distributed systems that minimizes the request number. It is based on piggybacking of dependency vectors not on request messages but on computation messages and replies. Here, a process can initiate checkpointing only if it is a popular process (a process that has dependency information percentage greater or equal to decision threshold). We compare our algorithm called NNB (New Non-Blocking) to CSNB protocol (Cao and Singhal non-blocking protocol) using simulation. To evaluate protocols performance, we choose request number, mutable checkpoints number and first phase duration as performance metrics.
Keywords :
checkpointing; distributed processing; fault tolerance; protocols; CSNB protocol; Cao and Singhal nonblocking protocol; NNB algorithm; computation messages; coordinated checkpointing protocol; mutable checkpoints number; new nonblocking algorithm; nonblocking checkpointing algorithm; performance metrics; piggybacking; popular process-based distributed systems; request messages; request number; Artificial neural networks; Checkpointing; Computational modeling; Measurement; Protocols; Radiation detectors; Vectors; coordinated checkpointing; distributed systems; fault tolerance; simulation;
Conference_Titel :
Innovations in Information Technology (IIT), 2012 International Conference on
Conference_Location :
Abu Dhabi
Print_ISBN :
978-1-4673-1100-7
DOI :
10.1109/INNOVATIONS.2012.6207769