Title :
Roll-forward error recovery in embedded real-time systems
Author :
Xu, Jie ; Randell, Brian
Author_Institution :
Dept. of Comput. Sci., Newcastle upon Tyne Univ., UK
Abstract :
Roll-forward checkpointing schemes are developed in order to avoid rollback in the presence of independent faults and to increase the possibility that a task completes within a tight deadline. However, despite of the adoption of roll-forward recovery, these schemes are not necessarily appropriate for time-critical applications because interactions with the external environment and communications between processes must be deferred during checkpoint validation steps (typically, two checkpoint intervals) until the fault-free processors are identified. The deadlines on providing services may thus be violated. In this paper we present and discuss two alternative roll-forward recovery schemes, especially for time-critical and interaction-intensive applications, that deliver correct, timely results even when checkpoint validation is required
Keywords :
distributed processing; fault tolerant computing; multiprocessing systems; real-time systems; system recovery; checkpoint validation; checkpoint validation steps; checkpointing schemes; embedded real-time systems; fault-free processors; interaction-intensive applications; roll-forward error recovery; time-critical applications; Checkpointing; Delay; Error correction; Fault detection; Fault diagnosis; Fault tolerant systems; Real time systems; Redundancy; Time factors; Timing;
Conference_Titel :
Parallel and Distributed Systems, 1996. Proceedings., 1996 International Conference on
Conference_Location :
Tokyo
Print_ISBN :
0-8186-7267-6
DOI :
10.1109/ICPADS.1996.517589