Title :
Adaptive fault tolerance and graceful degradation under dynamic hard real-time scheduling
Author :
González, Oscar ; Shrikumar, H. ; Stankovic, John A. ; Ramamritham, Krithi
Author_Institution :
Dept. of Comput. & Inf. Sci., Massachusetts Univ., Amherst, MA, USA
Abstract :
Static redundancy allocation is inappropriate in hard real-time systems that operate in variable and dynamic environments, (e.g., radar tracking, avionics). Adaptive fault tolerance (AFT) can assure adequate reliability of critical modules, under temporal and resource constraints, by allocating just as much redundancy to less critical modules as can be afforded thus gracefully reducing their resource requirement. We propose a mechanism for supporting adaptive fault tolerance in a real-time system. Adaptation is achieved by choosing a suitable redundancy strategy for a dynamically arriving computation to assure required reliability and to maximize the potential for fault tolerance while ensuring that deadlines are met. The proposed approach is evaluated using a real-life workload simulating radar tracking software in AWACS early warning aircraft. The results demonstrate that our technique outperforms static fault tolerance strategies in terms of tasks meeting their timing constraints. Further, we show that the gain in this timing-centric performance metric does not reduce the fault tolerance of the executing task below a predefined minimum level. Overall, the evaluation indicates that the proposed ideas result in a system that dynamically provides QoS guarantees along the fault-tolerance dimension.
Keywords :
aircraft computers; fault tolerant computing; performance evaluation; radar computing; radar tracking; real-time systems; scheduling; telecommunication computing; timing; AWACS; QoS guarantees; adaptive fault tolerance; dynamic hard real-time scheduling; early warning aircraft; graceful degradation; quality of service; radar tracking software; redundancy strategy; resource constraints; static redundancy allocation; temporal constraints; timing constraints; timing-centric performance metric; variable environments; Aerospace electronics; Aircraft; Computational modeling; Degradation; Fault tolerance; Fault tolerant systems; Radar tracking; Real time systems; Redundancy; Resource management;
Conference_Titel :
Real-Time Systems Symposium, 1997. Proceedings., The 18th IEEE
Conference_Location :
San Francisco, CA, USA
Print_ISBN :
0-8186-6600-5
DOI :
10.1109/REAL.1997.641271