• DocumentCode
    1727457
  • Title

    Availability requirement for fault management server

  • Author

    Han, Jame J. ; Sun, Hairong ; Levendel, Haim

  • Author_Institution
    High Availability & Reliability Technol. Center, Motorola, Elk Grove Village, IL, USA
  • fYear
    2001
  • fDate
    6/23/1905 12:00:00 AM
  • Firstpage
    468
  • Lastpage
    473
  • Abstract
    In this paper, we examine the availability requirement for the fault management server in high-availability communication systems. According to our study, we find that the availability of the fault management server does not need to be 99.999% in order to guarantee a 99.999% system availability as long as the fail-safe ratio (the probability that the failure of the fault management server will not bring the system down) and the fault coverage ratio (the probability that the failure in the system can be detected and recovered by the fault management server) are sufficiently high. Tradeoffs can be made among the availability of the fault management server, the fail-safe ratio and the fault coverage ratio to optimize system availability. A cost-effective design for the fault management server is proposed in this paper
  • Keywords
    software fault tolerance; system recovery; availability requirement; cost-effective design; fail-safe ratio; fault coverage ratio; fault management server; high-availability communication systems; system availability; Availability; Computer network management; Energy management; Fault detection; Memory management; Network servers; Power supplies; Power system management; Sun; Technology management;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Software and Applications Conference, 2001. COMPSAC 2001. 25th Annual International
  • Conference_Location
    Chicago, IL
  • ISSN
    0730-3157
  • Print_ISBN
    0-7695-1372-7
  • Type

    conf

  • DOI
    10.1109/CMPSAC.2001.960654
  • Filename
    960654