DocumentCode :
1219663
Title :
Location of a faulty module in a computing system
Author :
Lin, Tein-hsiang ; Shin, Kang G.
Author_Institution :
Dept. of Electr. Eng. & Comput. Sci., Michigan Univ., Ann Arbor, MI, USA
Volume :
39
Issue :
2
fYear :
1990
fDate :
2/1/1990 12:00:00 AM
Firstpage :
182
Lastpage :
194
Abstract :
Considering the interplay between different phases of fault tolerance, a new problem of locating a faulty module in a computing system is formulated and solved. First, the probability of each module being faulty, or faulty probability, is calculated using the likelihood principle from the model parameters for fault detection, diagnostics, error propagation, and error detection. Then, based on the faulty probabilities and a given required diagnostic coverage, the order in which modules are to be diagnosed and the maximum time allotted to diagnose each module are determined by minimizing the average total diagnostic time. An example is presented and analyzed to answer the question of whether or not a system should delay the diagnosis upon detection of an error until more errors are detected
Keywords :
fault tolerant computing; computing system; error detection; error propagation; fault tolerance; faulty module; likelihood principle; model parameters; probability; Circuit faults; Decision theory; Delay systems; Fault detection; Fault diagnosis; Fault location; Fault tolerant systems; Hardware; Helium; Probability;
fLanguage :
English
Journal_Title :
Computers, IEEE Transactions on
Publisher :
ieee
ISSN :
0018-9340
Type :
jour
DOI :
10.1109/12.45204
Filename :
45204
Link To Document :
بازگشت