DocumentCode
2176858
Title
Impact of fault management server and its failure-related parameters on high-availability communication systems
Author
Sun, Hairong ; Han, James J. ; Levendel, Isaac
fYear
2002
fDate
2002
Firstpage
679
Lastpage
686
Abstract
In this paper, we investigate the impact of a fault management server and its failure-related parameters on high-availability communication systems. The key point is that, to achieve high overall availability of a communication system, the availability of the fault management server itself is not as important as its fail-safe ratio and fault coverage. In other words, in building fault management servers, more attention should be paid to improving the server´s ability of detecting faults in functional units and its own isolation under failure from the functional units. Tradeoffs can be made between the availability of the fault management server, the fail-safe ratio and the fault coverage ratio to optimize system availability. A cost-effective design for the fault management server is proposed in this paper.
Keywords
computer communications software; file servers; software fault tolerance; fail-safe ratio; failure-related parameters; fault management server; fault-tolerant computers; high-availability communication systems; logic self-checking; Availability; Communication system software; Computer architecture; Computer network management; Fault detection; Memory management; Network servers; Power system management; Sun; Traffic control;
fLanguage
English
Publisher
ieee
Conference_Titel
Dependable Systems and Networks, 2002. DSN 2002. Proceedings. International Conference on
Print_ISBN
0-7695-1101-5
Type
conf
DOI
10.1109/DSN.2002.1029013
Filename
1029013
Link To Document