Title :
Implementation and performance evaluation of an adaptable failure detector
Author :
Bertier, Marin ; Marin, Olivier ; Sens, Pierre
Author_Institution :
Lab. d´´Informatique, Paris VI Univ., France
Abstract :
Chandra and Toueg (1996) introduced the concept of unreliable failure detectors, They showed how, by adding these detectors to an asynchronous system, it is possible to solve the Consensus problem. In this paper, we propose a new implementation of a failure detector. This implementation is a variant of the heartbeat failure detector which is adaptable and can support scalable applications. In this implementation we dissociate two aspects: a basic estimation of the expected arrival date to provide a short detection time, and an adaptation of the quality of service according to application needs. The latter is based on two principles: an adaptation layer and a heuristic to adapt the sending period of "I am alive" messages.
Keywords :
distributed processing; fault tolerant computing; quality of service; system recovery; adaptable failure detector; adaptation layer; asynchronous system; expected arrival date estimation; heartbeat failure detector; heuristic; performance evaluation; quality of service; scalable applications; short detection time; Broadcasting; Computer crashes; Delay effects; Detectors; Fault detection; Fault tolerance; Fault tolerant systems; Heart beat; Quality of service; Stability;
Conference_Titel :
Dependable Systems and Networks, 2002. DSN 2002. Proceedings. International Conference on
Print_ISBN :
0-7695-1101-5
DOI :
10.1109/DSN.2002.1028920