DocumentCode :
1974459
Title :
DPCP (Discard Past Consider Present)-a novel approach to adaptive fault detection in distributed systems
Author :
Sotoma, Irineu ; Madeira, Edmundo Robmundo Mauro
Author_Institution :
Inst. of Comput., UNICAMP, Brazil
fYear :
2001
fDate :
2001
Firstpage :
76
Lastpage :
82
Abstract :
Fault detection is a fundamental issue for fault tolerance in distributed systems. The paper presents the DPCP (Discard Past Consider Present) approach, that discards the last elapsed times of fault detection messages and considers only the current one. This way, DPCP allows us to perform a fast, accurate and scalable adaptive fault monitoring for asynchronous distributed systems. The scalability comes from the parameter minimum-time unit, that controls the minimum frequency of the fault monitoring messages. The fastness and accuracy of fault monitoring come from the changing of timeout and monitoring interval values as soon as the system workload and the minimum-time unit allow. Some DPCP experiments on ACE+TAO were made to observe DPCP behavior on changing network workloads
Keywords :
distributed object management; fault tolerant computing; monitoring; DPCP approach; Discard Past Consider Present approach; adaptive fault detection; asynchronous distributed systems; fault monitoring messages; fault tolerance; interval values; minimum-time unit parameter; network workloads; scalable adaptive fault monitoring; timeout; Conferences; Distributed computing; Fault detection;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Distributed Computing Systems, 2001. FTDCS 2001. Proceedings. The Eighth IEEE Workshop on Future Trends of
Conference_Location :
Bologna
Print_ISBN :
0-7695-1384-0
Type :
conf
DOI :
10.1109/FTDCS.2001.969625
Filename :
969625
Link To Document :
بازگشت