DocumentCode :
1118180
Title :
Fault-Tolerant Computing: An Introduction and a Perspective
Author :
Kime, Charles R.
Author_Institution :
Department of Electrical and Computer Engineering, University of Wisconsin
Issue :
5
fYear :
1975
fDate :
5/1/1975 12:00:00 AM
Firstpage :
457
Lastpage :
460
Abstract :
FAULT-TOLERANT computing has been defined as "the ability to execute specified algorithms correctly regardless of hardware failures, total system flaws, or program fallacies" [1]. To the extent that a system falls short of meeting the requirements of this definition, it can be labeled a partially fault-tolerant system [2]. Thus the definition of fault-tolerant computing provides a standard against which to measure all systems having a degree of fault tolerance. In particular, one can classify systems according to: 1), the amount of manual intervention required in performing three basic functions, and 2) the class of faults covered by three basic functions involved in fault tolerance: system validation, fault diagnosis, and fault masking or recovery. The word "fault" here is used to inclusively describe "failures, flaws, and fallacies" in the original definition. The first function is involved in the design and production of the system hardware and software, while the last two functions are embodied in the system itself. Likewise, the first function is directed to handling faults arising from design and production errors, whereas the last two functions are aimed at faults due to random hardware failures.
Keywords :
Fault detection; Fault diagnosis; Fault tolerance; Fault tolerant systems; Hardware; Measurement standards; Production systems; Software systems; Solids; Testing;
fLanguage :
English
Journal_Title :
Computers, IEEE Transactions on
Publisher :
ieee
ISSN :
0018-9340
Type :
jour
DOI :
10.1109/T-C.1975.224246
Filename :
1672839
Link To Document :
بازگشت