Title :
Optimal discrimination between transient and permanent faults
Author :
Pizza, M. ; Strigini, L. ; Bondavalli, A. ; di Giandomenico, F.
Author_Institution :
Centre for Software Reliability, City Univ., London, UK
Abstract :
An important practical problem in fault diagnosis is discriminating between permanent faults and transient faults. In many computer systems, the majority of errors are due to transient faults. Many heuristic methods have been used for discriminating between transient and permanent faults; however, we have found no previous work stating this decision problem in clear probabilistic terms. We present an optimal procedure for discriminating between transient and permanent faults, based on applying Bayesian inference to the observed events (correct and erroneous results). We describe how the assessed probability that a module is permanently faulty must vary with observed symptoms. We describe and demonstrate our proposed method on a simple application problem, building the appropriate equations and showing numerical examples. The method can be implemented as a run-time diagnosis algorithm at little computational cost; it can also be used to evaluate any heuristic diagnostic procedure by comparison
Keywords :
Bayes methods; fault diagnosis; fault tolerant computing; inference mechanisms; probability; uncertainty handling; Bayesian inference; computational cost; computer fault diagnosis; decision problem; errors; heuristic methods; numerical examples; optimal procedure; permanent faults; probability; run-time diagnosis algorithm; transient faults; Application software; Bayesian methods; Bonding; Computer errors; Costs; Equations; Fault diagnosis; Fault tolerant systems; Software reliability; Testing;
Conference_Titel :
High-Assurance Systems Engineering Symposium, 1998. Proceedings. Third IEEE International
Conference_Location :
Washington, DC
Print_ISBN :
0-8186-9221-9
DOI :
10.1109/HASE.1998.731615