Title :
Probabilistic evaluation of online checks in fault-tolerant multiprocessor systems
Author :
Nair, V.S.S. ; Hoskote, Yatin Vasant ; Abraham, Jacob A.
Author_Institution :
Dept. of Comput. Sci. & Eng., Southern Methodist Univ., Dallas, TX, USA
fDate :
5/1/1992 12:00:00 AM
Abstract :
The analysis of fault-tolerant multiprocessor systems that use concurrent error detection (CED) schemes is much more difficult than the analysis of conventional fault-tolerant architectures. Various analytical techniques have been proposed to evaluate CED schemes deterministically. However, these approaches are based on worst-case assumptions related to the failure of system components. Often, the evaluation results do not reflect the actual fault tolerance capabilities of the system. A probabilistic approach to evaluate the fault detecting and locating capabilities of online checks. in a system is developed. The various probabilities associated with the checking schemes are identified and used in the framework of the matrix-based model. Based on these probabilistic matrices, estimates for the fault tolerance capabilities of various systems are derived analytically
Keywords :
fault tolerant computing; multiprocessing systems; probability; concurrent error detection; fault detection; fault location; fault-tolerant multiprocessor systems; matrix-based model; online checks; probabilistic evaluation; probabilistic matrices; Algorithm design and analysis; Analytical models; Constraint theory; Error analysis; Fault detection; Fault tolerance; Fault tolerant systems; Jacobian matrices; Multiprocessing systems; Signal processing algorithms;
Journal_Title :
Computers, IEEE Transactions on