Title :
Experimental evaluation of failure-detection schemes in real-time communication networks
Author :
Seungjae Han ; Shin, K.G.
Author_Institution :
Dept. of Electr. Eng. & Comput. Sci., Michigan Univ., Ann Arbor, MI, USA
Abstract :
An effective failure-detection scheme is essential for reliable communication services. Most computer network rely on behavior-based detection schemes: each node uses heartbeats to detect the failure of its neighbor nodes, and the transport protocol (like TCP) achieves reliable communication by acknowledgment/retransmission. In this paper, we experimentally evaluate the effectiveness of such behavior-based detection schemes in real-time communication. Specifically, we measure and analyze the coverage and latency of two failure-detection schemes-neighbor detection and end-to-end detection-through fault-injection experiments. The experimental results have shown that a significant portion of failures can be detected very quickly by the neighbor detection scheme, while the end-to-end detection scheme uncovers the remaining failures with larger detection latencies.
Keywords :
computer network reliability; failure analysis; message passing; real-time systems; acknowledgment/retransmission; behavior-based detection schemes; end-to-end detection; failure-detection schemes; fault-injection experiments; heartbeats; latency; neighbor detection; real-time communication networks; reliable communication services; transport protocol; Application software; Communication networks; Computer network reliability; Computer networks; Control systems; Delay; Fault detection; Hardware; Intelligent networks; Telecommunication network reliability;
Conference_Titel :
Fault-Tolerant Computing, 1997. FTCS-27. Digest of Papers., Twenty-Seventh Annual International Symposium on
Conference_Location :
Seattle, WA, USA
Print_ISBN :
0-8186-7831-3
DOI :
10.1109/FTCS.1997.614085