DocumentCode :
1352239
Title :
High-availability computer systems
Author :
Gray, Jim ; Siewiorek, Daniel P.
Author_Institution :
Digital Equipment Corp., San Francisco, CA, USA
Volume :
24
Issue :
9
fYear :
1991
Firstpage :
39
Lastpage :
48
Abstract :
The techniques used to build highly available computer systems are sketched. Historical background is provided, and terminology is defined. Empirical experience with computer failure is briefly discussed. Device improvements that have greatly increased the reliability of digital electronics are identified. Fault-tolerant design concepts and approaches to fault-tolerant hardware are outlined. The role of repair and maintenance and of design-fault tolerance is discussed. Software repair is considered. The use of pairs of computer systems at separate locations to guard against unscheduled outages due to outside sources (communication or power failures, earthquakes, etc.) is addressed.<>
Keywords :
fault tolerant computing; computer failure; design-fault tolerance; digital electronics; fault-tolerant hardware; highly available computer systems; maintenance; repair; unscheduled outages; Application software; Art; Availability; Electron tubes; Fault detection; Fault tolerant systems; Hardware; Mission critical systems; Relays; Testing;
fLanguage :
English
Journal_Title :
Computer
Publisher :
ieee
ISSN :
0018-9162
Type :
jour
DOI :
10.1109/2.84898
Filename :
84898
Link To Document :
بازگشت