Title :
Synchronous Consensus for dependent process failures
Author :
Junqueira, Flavio P. ; Marzullo, Keith
Author_Institution :
Dept. of Comput. Sci. & Eng., Univ. of California San Diego, La Jolla, CA, USA
Abstract :
We present a new abstraction to replace the t of n assumption used in designing fault-tolerant algorithms. This abstraction models dependent process failures yet it is as simple to use as the t of n assumption. To illustrate this abstraction, we consider Consensus for synchronous systems with both crash and arbitrary process failures. By considering failure correlations, we are able to reduce latency and enable the solution of Consensus for system configurations in which it is not possible when forced to use algorithms designed under the t of n assumption. We show that, in general, the number of rounds required in the worst case when assuming crash failures is different from the number of rounds required when assuming arbitrary failures. This is in contrast with the traditional result under the t of n assumption.
Keywords :
distributed algorithms; fault tolerant computing; synchronisation; system recovery; distributed algorithm; failure correlation; fault-tolerant algorithm; process failure; synchronous consensus; synchronous system; system crash; Algorithm design and analysis; Computer crashes; Computer science; Delay; Design engineering; Distributed algorithms; Drives; Failure analysis; Fault tolerance; Fault tolerant systems;
Conference_Titel :
Distributed Computing Systems, 2003. Proceedings. 23rd International Conference on
Print_ISBN :
0-7695-1920-2
DOI :
10.1109/ICDCS.2003.1203476