DocumentCode
1320947
Title
A Unified Method for Analyzing Mission Reliability for Fault Tolerant Computer Systems
Author
Bricker, Jacob L.
Author_Institution
Hughes Aircraft Company, Fullerton, Calif. 92634.
Issue
2
fYear
1973
fDate
6/1/1973 12:00:00 AM
Firstpage
72
Lastpage
77
Abstract
A reliability model is proposed and evaluated for a fault tolerant computer system which consists of multiple classes of modules and allows for degraded modes of performance. Each module of a given class has both an active and a passive hazard rate; constant hazard rates are assumed for active and dormant failures, and the given class may operate either in N Modular Redundancy (NMR: n + 1 out of 2n + 1 = N) or as a standby sparing system. The model allows for mission-phase changes at deterministic time points when the numbers of modules per class can be changed. The analysis proceeds by generalizing the notions of standby and NMR redundancy, which for N = 3 is TMR (Triple Modular Redundancy), into a concept called hybrid-degraded redundancy. The probabilistic evaluation of the unified redundancy concept is then developed to yield, for a given modular class, the joint distribution of success and the number of nonfailed modules from that class, at special times. With this information, a Markov chain analysis gives the reliability of an entire sequence of phases (mission profile).
Keywords
Aircraft; Distributed computing; Fault tolerant systems; Hazards; Jacobian matrices; Logic; NASA; Nuclear magnetic resonance; Redundancy; Space stations;
fLanguage
English
Journal_Title
Reliability, IEEE Transactions on
Publisher
ieee
ISSN
0018-9529
Type
jour
DOI
10.1109/TR.1973.5216037
Filename
5216037
Link To Document