Title :
Low-cost error containment and recovery for onboard guarded software upgrading and beyond
Author :
Tai, Ann T. ; Tso, Kam S. ; Alkalai, Leon ; Chau, Savio N. ; Sanders, William H.
Author_Institution :
IA Tech Inc., Los Angeles, CA, USA
fDate :
2/1/2002 12:00:00 AM
Abstract :
Message-driven confidence-driven (MDCD) error containment and recovery, a low-cost approach to mitigating the effect of software design faults in distributed embedded systems, is developed for onboard guarded software upgrading for deep-space missions. In this paper, we first describe and verify the MDCD algorithms in which we introduce the notion of "confidence-driven" to complement the "communication-induced" approach employed by a number of existing checkpointing protocols to achieve error containment and recovery efficiency. We then conduct a model-based analysis to show that the algorithms ensure low performance overhead. Finally, we discuss the advantages of the MDCD approach and its potential utility as a general-purpose, low-cost software fault tolerance technique for distributed embedded computing
Keywords :
aerospace computing; distributed programming; embedded systems; software cost estimation; software fault tolerance; software maintenance; software performance evaluation; system recovery; checkpointing protocols; communication-induced approach; deep-space missions; distributed embedded computing; efficiency; error containment; error recovery; global state consistency; global state recoverability; low-cost approach; message-driven confidence-driven algorithms; model-based analysis; onboard guarded software upgrading; performance overhead; software design faults; software fault tolerance; Algorithm design and analysis; Checkpointing; Embedded computing; Embedded software; Embedded system; Fault tolerance; Performance analysis; Protocols; Software design;
Journal_Title :
Computers, IEEE Transactions on