Title :
A Low-Overhead Cooperative Failure Detector
Author :
Jiaxi Liu;Zhibo Wu;Jinghui Lan;Jian Dong;Jin Wu;Jiaxin Yu
Author_Institution :
Sch. of Comput. Sci. &
Abstract :
Failure detectors are one of the fundamental components for ensuring the high availability of large scale distributed system. The increasing popularity and demand for the large scale distributed system came with an increase in the overhead and complexity of failure detection that posed a challenge obstructing further development. In order to solve the challenge, this paper proposes a new failure detector -- S-AFD which combines adaptive failure detection based on QoS (quality of service) and cooperative mechanism that share negative messages among different active nodes. It does not only reduce the detection overhead, but also adapt the various network conditions. Through analysis of experiments, it is shown that the performance of S-AFD has a clearly improvement compared with the traditional implementations of failure detectors.
Keywords :
"Detectors","Quality of service","Peer-to-peer computing","Measurement","Monitoring","Adaptive systems","Electronic mail"
Conference_Titel :
Instrumentation and Measurement, Computer, Communication and Control (IMCCC), 2015 Fifth International Conference on
DOI :
10.1109/IMCCC.2015.177