DocumentCode :
564967
Title :
Preventing state divergence in duplex systems using causal memory
Author :
Chitsaz, Behzad ; Razzazi, Mohammadreza
Author_Institution :
Dept. of Comput. Eng. & Inf. Technol., Amirkabir Univ. of Technol., Tehran, Iran
fYear :
2012
fDate :
21-25 May 2012
Firstpage :
257
Lastpage :
261
Abstract :
Replicated execution of distributed programs provides a means of masking hardware or software failures in a distributed system. Application level entities (processes, objects) are replicated to execute on distinct processors. Such replica entities communicate via message-passing. Non-determinism within the replicas could cause messages to be processed in non-identical order, producing a divergence of state. The replicas could thereafter produce inconsistent responses to identical messages and hence appear to be faulty. The partial-order model of distributed computations based on the happened-before relation like primary-backup approach has been criticized for allowing false causality between messages, the false causality causes unnecessary blocking processes and results in high time overhead for replicating entities. In this paper we use the concept of causal memories and multi version states to reduce the false causality between messages. We capture the read/write operations on the variables of each process to find out the dependencies between messages, and save some old values of variables to use in cases the read that operations may cause divergence in the states of replicas. The results of simulation show that this approach has lower execution time than the primary-backup approach.
Keywords :
distributed programming; message passing; replicated databases; software architecture; software fault tolerance; storage management; system recovery; application level entities; causal memory; distributed computations; distributed programs; duplex systems; false causality reduction; hardware failure masking; message-passing; nonidentical order; primary-backup approach; read-write operations; replicated execution; software failure masking; state divergence prevention; Computational modeling; Data structures; Delay; Fault tolerance; Fault tolerant systems; History; Memory management; Causal Memory; Replicated Distributed Systems; Replication Consistency;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
MIPRO, 2012 Proceedings of the 35th International Convention
Conference_Location :
Opatija
Print_ISBN :
978-1-4673-2577-6
Type :
conf
Filename :
6240652
Link To Document :
بازگشت