DocumentCode :
122714
Title :
Real-time scheduling under fault bursts with multiple recovery strategy
Author :
Haque, Mohammad Ariful ; Aydin, Hakan ; Dakai Zhu
Author_Institution :
Dept. of Comput. Sci., George Mason Univ., Fairfax, VA, USA
fYear :
2014
fDate :
15-17 April 2014
Firstpage :
63
Lastpage :
74
Abstract :
In this paper, we consider the feasibility problem of a set of real-time jobs which may be subject to a fault burst during execution. A fault burst represents a time interval during which multiple jobs may incur faults; hence multiple recoveries may be needed. We show that determining the feasibility of a real-time system, which may be subject to a fault burst that may last at most Δ time units, is an NP-Hard problem even when the exact position of the fault burst is known a priori. However, in a practical system, the fault burst may occur at any arbitrary and unpredictable time. We develop feasibility analysis by assuming multiple recovery strategy where, in addition to the job at the end of which the fault is detected, all preempted tasks are also re-executed. We formally characterize the overhead that a scheduler incurs due to a fault burst and present a generic recovery strategy, called Δ-idling, that is shown to minimize the worst-case overhead for any priority-driven scheduling algorithm. Next, we analyze periodic task systems. We show that the preemptive EDF policy, when coupled with Δ-idling, provides the highest possible utilization bound ½ (1 - Δ over Pmin), where Pmin is the smallest task period. We also present an empirical evaluation of the EDF policy with Δ-idling over synthetically generated task sets, and show that it offers a clear improvement over the naive EDF policy that triggers the recovery tasks as soon as an error is detected.
Keywords :
computational complexity; fault diagnosis; processor scheduling; real-time systems; system recovery; Δ-idling; NP-Hard problem; fault burst; fault detection; generic recovery strategy; multiple recovery strategy; naive EDF policy; periodic task systems; priority-driven scheduling algorithm; real-time jobs; real-time scheduling; real-time system; scheduler overhead; utilization bound; worst-case overhead; Circuit faults; Fault detection; Real-time systems; Schedules; Scheduling algorithms; Transient analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Real-Time and Embedded Technology and Applications Symposium (RTAS), 2014 IEEE 20th
Conference_Location :
Berlin
ISSN :
1080-1812
Print_ISBN :
978-1-4799-4691-4
Type :
conf
DOI :
10.1109/RTAS.2014.6925991
Filename :
6925991
Link To Document :
بازگشت