Title :
Fail-aware failure detectors
Author :
Fetzer, Christof ; Cristian, Flaviu
Author_Institution :
Dept. of Comput. Sci. & Eng., California Univ., San Diego, La Jolla, CA, USA
Abstract :
In existing asynchronous distributed systems it is impossible to implement failure detectors which are perfect, i.e. they only suspect crashed processes and eventually suspect all crashed processes. Some recent research has however proposed that any “reasonable” failure detector for solving the election problem must be perfect. We address this problem by introducing two new classes of fail-aware failure detectors that are (1) implementable in existing asynchronous distributed systems, (2) not necessarily perfect, and (3) can be used to solve the election problem. In particular we show that there exists a fail-aware failure detector that allows to solve the election problem and which is strictly weaker than a perfect failure detector
Keywords :
distributed processing; fault tolerant computing; asynchronous distributed systems; crashed processes; election problem; fail-aware failure detectors; failure detectors; Broadcasting; Clocks; Computer crashes; Computer science; Detectors; Fault tolerant systems; Hardware; Nominations and elections; Safety; Workstations;
Conference_Titel :
Reliable Distributed Systems, 1996. Proceedings., 15th Symposium on
Conference_Location :
Nigara-on-the-Lake, Ont.
Print_ISBN :
0-8186-7481-4
DOI :
10.1109/RELDIS.1996.559722