DocumentCode
1727457
Title
Availability requirement for fault management server
Author
Han, Jame J. ; Sun, Hairong ; Levendel, Haim
Author_Institution
High Availability & Reliability Technol. Center, Motorola, Elk Grove Village, IL, USA
fYear
2001
fDate
6/23/1905 12:00:00 AM
Firstpage
468
Lastpage
473
Abstract
In this paper, we examine the availability requirement for the fault management server in high-availability communication systems. According to our study, we find that the availability of the fault management server does not need to be 99.999% in order to guarantee a 99.999% system availability as long as the fail-safe ratio (the probability that the failure of the fault management server will not bring the system down) and the fault coverage ratio (the probability that the failure in the system can be detected and recovered by the fault management server) are sufficiently high. Tradeoffs can be made among the availability of the fault management server, the fail-safe ratio and the fault coverage ratio to optimize system availability. A cost-effective design for the fault management server is proposed in this paper
Keywords
software fault tolerance; system recovery; availability requirement; cost-effective design; fail-safe ratio; fault coverage ratio; fault management server; high-availability communication systems; system availability; Availability; Computer network management; Energy management; Fault detection; Memory management; Network servers; Power supplies; Power system management; Sun; Technology management;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Software and Applications Conference, 2001. COMPSAC 2001. 25th Annual International
Conference_Location
Chicago, IL
ISSN
0730-3157
Print_ISBN
0-7695-1372-7
Type
conf
DOI
10.1109/CMPSAC.2001.960654
Filename
960654
Link To Document