DocumentCode :
1956361
Title :
Real-Time Distributed Discrete-Event Execution with Fault Tolerance
Author :
Feng, Thomas Huining ; Lee, Edward A.
Author_Institution :
Dept. of Electr. Eng. & Comput. Sci., California Univ., Berkeley, CA
fYear :
2008
fDate :
22-24 April 2008
Firstpage :
205
Lastpage :
214
Abstract :
We build on PTIDES, a programming model for distributed embedded systems that uses discrete-event (DE) models as program specifications. PTIDES improves on distributed DE execution by allowing more concurrent event processing without backtracking. This paper discusses the general execution strategy for PTIDES, and provides two feasible implementations. This execution strategy is then extended with tolerance for hardware errors. We take a program transformation approach to automatically enhance DE models with incremental checkpointing and state recovery functionality. Our fault tolerance mechanism is lightweight and has low overhead. It requires very little human intervention. We incorporate this mechanism into PTIDES for efficient execution of fault- tolerant real-time distributed DE systems.
Keywords :
checkpointing; distributed processing; embedded systems; formal specification; software fault tolerance; PTIDES; concurrent event processing; distributed embedded systems; fault tolerance; incremental checkpointing; program specifications; program transformation approach; real-time distributed discrete-event execution; Application software; Checkpointing; Concurrent computing; Distributed computing; Embedded software; Embedded system; Fault tolerance; Fault tolerant systems; Hardware; Real time systems; discrete event; distributed system; fault tolerance; real time;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Real-Time and Embedded Technology and Applications Symposium, 2008. RTAS '08. IEEE
Conference_Location :
St. Louis, MO
ISSN :
1545-3421
Print_ISBN :
978-0-7695-3146-5
Type :
conf
DOI :
10.1109/RTAS.2008.22
Filename :
4550793
Link To Document :
بازگشت