DocumentCode
2568569
Title
An algorithm based fault tolerance technique for safety-critical applications
Author
Smith, D. Todd ; DeLong, Todd A. ; Johnson, Barry W. ; Profeta, Joseph A., III
Author_Institution
Virginia Univ., Charlottesville, VA, USA
fYear
1997
fDate
13-16 Jan 1997
Firstpage
278
Lastpage
285
Abstract
The design of safety critical systems is a difficult and time consuming task. The traditional design of safety critical systems involves the use of proprietary hardware and software. The full custom design approach is becoming unacceptable because performing a complete redesign for each different class of control applications is unacceptable. Additionally, traditional safety critical systems are designed based on locating all faults in the system which can affect safety via diagnostics. The increasing complexity of the control algorithms which must be processed in a safety critical fashion makes the fault space of the system increase in size to the point where the design of diagnostics becomes quite difficult and time consuming. One promising methodology which can assist in reducing the design effort associated with safety critical systems while using commercial off-the-shelf (COTS) components is algorithm-based fault tolerance (ABFT). Unfortunately, existing ABFT techniques are currently not suitable for safety critical applications. A new safety critical ABFT (SC-ABFT) technique is presented. The checking scheme for the SC-ABFT method is derived based on verifying the correctness of a given control application as it is being evaluated. Also, the probability of detecting a safety-critical error can made as close to 1.0 as desired by varying certain parameters
Keywords
digital control; fault tolerant computing; reliability; safety systems; safety-critical software; software fault tolerance; software reliability; algorithm-based fault tolerance technique; commercial off-the-shelf components; control algorithm complexity; diagnostics; hardware; safety-critical applications; safety-critical error detection; software; Algorithm design and analysis; Application software; Control systems; Fault tolerance; Fault tolerant systems; Hardware; Signal design; Size control; Software safety; Switches;
fLanguage
English
Publisher
ieee
Conference_Titel
Reliability and Maintainability Symposium. 1997 Proceedings, Annual
Conference_Location
Philadelphia, PA
ISSN
0149-144X
Print_ISBN
0-7803-3783-2
Type
conf
DOI
10.1109/RAMS.1997.571720
Filename
571720
Link To Document