Title :
Modular Consensus Algorithms for the Crash-Recovery Model
Author :
Freiling, Felix C. ; Lambertz, Christian ; Majster-Cederbaum, Mila
Author_Institution :
Dept. of Comput. Sci., Univ. of Mannheim, Mannheim, Germany
Abstract :
In the crash-recovery failure model of asynchronous distributed systems, processes can temporarily stop to execute steps and later restart their computation from a predefined local state. The crash-recovery model is much more realistic than the crash-stop failure model in which processes merely are allowed to stop executing steps. The additional complexity is reflected in the multitude of assumptions and the technical complexity of algorithms which have been developed for that model. We focus on the problem of consensus in the crash-recovery model, but instead of developing completely new algorithms from scratch, our approach aims at re-using existing crash-stop consensus algorithms in a modular way using the abstraction of failure detectors. As a result, we present three new consensus algorithms for the crash-recovery model for different types of assumptions.
Keywords :
distributed processing; system recovery; asynchronous distributed system; crash-recovery failure; crash-stop consensus; crash-stop failure; failure detector; modular consensus; technical complexity; Algorithm design and analysis; Application software; Computational modeling; Computer crashes; Computer science; Concurrent computing; Detectors; Distributed computing; Electronic mail; Fault tolerant systems; asynchronous systems; consensus; fault tolerance; process crash and recovery; stable storage;
Conference_Titel :
Parallel and Distributed Computing, Applications and Technologies, 2009 International Conference on
Conference_Location :
Higashi Hiroshima
Print_ISBN :
978-0-7695-3914-0
DOI :
10.1109/PDCAT.2009.50