DocumentCode
796372
Title
Probabilistic evaluation of online checks in fault-tolerant multiprocessor systems
Author
Nair, V.S.S. ; Hoskote, Yatin Vasant ; Abraham, Jacob A.
Author_Institution
Dept. of Comput. Sci. & Eng., Southern Methodist Univ., Dallas, TX, USA
Volume
41
Issue
5
fYear
1992
fDate
5/1/1992 12:00:00 AM
Firstpage
532
Lastpage
541
Abstract
The analysis of fault-tolerant multiprocessor systems that use concurrent error detection (CED) schemes is much more difficult than the analysis of conventional fault-tolerant architectures. Various analytical techniques have been proposed to evaluate CED schemes deterministically. However, these approaches are based on worst-case assumptions related to the failure of system components. Often, the evaluation results do not reflect the actual fault tolerance capabilities of the system. A probabilistic approach to evaluate the fault detecting and locating capabilities of online checks. in a system is developed. The various probabilities associated with the checking schemes are identified and used in the framework of the matrix-based model. Based on these probabilistic matrices, estimates for the fault tolerance capabilities of various systems are derived analytically
Keywords
fault tolerant computing; multiprocessing systems; probability; concurrent error detection; fault detection; fault location; fault-tolerant multiprocessor systems; matrix-based model; online checks; probabilistic evaluation; probabilistic matrices; Algorithm design and analysis; Analytical models; Constraint theory; Error analysis; Fault detection; Fault tolerance; Fault tolerant systems; Jacobian matrices; Multiprocessing systems; Signal processing algorithms;
fLanguage
English
Journal_Title
Computers, IEEE Transactions on
Publisher
ieee
ISSN
0018-9340
Type
jour
DOI
10.1109/12.142679
Filename
142679
Link To Document