DocumentCode :
3019068
Title :
A Versatile, Proactive Dependability Approach to Handling Unanticipated Events in Distributed Systems
Author :
Narasimhan, Priya ; Rajkumar, Raj ; Thaker, Gautam ; Lardieri, Patrick
Author_Institution :
Carnegie Mellon Univ., Pittsburgh, PA, USA
fYear :
2005
fDate :
04-08 April 2005
Abstract :
The MEAD system that we are developing employs a synergistic combination of a reactive and a proactive fault-tolerance approach in order to address unanticipated events and hazards in real-time, fault-tolerant distributed systems. The reactive fault-tolerance approach involves active monitoring of the system to adapt the provided QoS and to allocate resources based on current conditions in the system. The proactive approach involves monitoring both the distributed applications and the network to seek pre-cursors to imminent failures, and then to trigger fault-recovery mechanisms in advance of the occurrence of the failure. The underlying ideas of the MEAD system have demonstrated initial promise through our enhanced capabilities to handle failures and unanticipated events, and to reduce jitter under faulty conditions.
Keywords :
distributed processing; fault tolerant computing; quality of service; resource allocation; system monitoring; system recovery; MEAD system; QoS; fault-recovery mechanisms; fault-tolerant distributed systems; jitter reduction; real-time systems; resources allocation; unanticipated event handling; Condition monitoring; Creep; Event detection; Fault tolerant systems; Hazards; Jitter; Programming profession; Real time systems; Resource management; Surges;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel and Distributed Processing Symposium, 2005. Proceedings. 19th IEEE International
Print_ISBN :
0-7695-2312-9
Type :
conf
DOI :
10.1109/IPDPS.2005.74
Filename :
1419977
Link To Document :
بازگشت