DocumentCode :
1383744
Title :
Formally verified on-line diagnosis
Author :
Walter, Chris J. ; Lincoln, Patrick ; Suri, Neeraj
Author_Institution :
WW Technol. Group, Ellicott City, MD, USA
Volume :
23
Issue :
11
fYear :
1997
fDate :
11/1/1997 12:00:00 AM
Firstpage :
684
Lastpage :
721
Abstract :
A reconfigurable fault tolerant system achieves the attributes of dependability of operations through fault detection, fault isolation and reconfiguration, typically referred to as the FDIR paradigm. Fault diagnosis is a key component of this approach, requiring an accurate determination of the health and state of the system. An imprecise state assessment can lead to catastrophic failure due to an optimistic diagnosis, or conversely, result in underutilization of resources because of a pessimistic diagnosis. Differing from classical testing and other off-line diagnostic approaches, we develop procedures for maximal utilization of the system state information to provide for continual, on-line diagnosis and reconfiguration capabilities as an integral part of the system operations. Our diagnosis approach, unlike existing techniques, does not require administered testing to gather syndrome information but is based on monitoring the system message traffic among redundant system functions. We present comprehensive on-line diagnosis algorithms capable of handling a continuum of faults of varying severity at the node and link level. Not only are the proposed algorithms on-line in nature, but are themselves tolerant to faults in the diagnostic process. Formal analysis is presented for all proposed algorithms. These proofs offer both insight into the algorithm operations and facilitate a rigorous formal verification of the developed algorithms
Keywords :
online operation; program diagnostics; program testing; program verification; reconfigurable architectures; software fault tolerance; FDIR paradigm; fault detection; fault diagnosis; fault isolation; formal verification; online diagnosis; operation dependability; optimistic diagnosis; pessimistic diagnosis; reconfigurable fault tolerant system; redundant system functions; system message traffic monitoring; system state information; testing; Algorithm design and analysis; Costs; Fault detection; Fault diagnosis; Fault tolerant systems; Formal verification; Helium; Monitoring; Resource management; System testing;
fLanguage :
English
Journal_Title :
Software Engineering, IEEE Transactions on
Publisher :
ieee
ISSN :
0098-5589
Type :
jour
DOI :
10.1109/32.637385
Filename :
637385
Link To Document :
بازگشت