• DocumentCode
    1551903
  • Title

    Experimental evaluation of the fault tolerance of an atomic multicast system

  • Author

    Arlat, Jean ; Aguera, Martine ; Crouzet, Yves ; Fabre, Jean-Charles ; Martins, Eliane ; Powell, David

  • Author_Institution
    LAAS-CNRS, Toulouse, France
  • Volume
    39
  • Issue
    4
  • fYear
    1990
  • fDate
    10/1/1990 12:00:00 AM
  • Firstpage
    455
  • Lastpage
    467
  • Abstract
    The authors present a study of the validation of a dependable local area network providing multipoint communication services based on an atomic multicast protocol. This protocol is implemented in specialized communication servers, that exhibit the fail-silent property, i.e. a kind of halt-on-failure behavior enforced by self-checking hardware. The tests that have been carried out utilize physical fault injection and have two objectives: (1) to estimate the coverage of the self-checking mechanisms of the communication servers, and (2) to test the properties that characterize the service provided by the atomic multicast protocol in the presence of faults. The testbed that has been developed to carry out the fault-injection experiments is described, and the major results are presented and analyzed. It is concluded that the fault-injection test sequence has evidenced the limited performance of the self-checking mechanisms implemented on the tested NAC (network attachment controller) and justified (especially for the main board) the need for the improved self-checking mechanisms implemented in an enhanced NAC architecture using duplicated circuitry
  • Keywords
    fault tolerant computing; local area networks; protocols; NAC; atomic multicast system; communication servers; dependable local area network; fail-silent property; fault tolerance; halt-on-failure behavior; multicast protocol; multipoint communication services; network attachment controller; physical fault injection; self-checking hardware; Automatic testing; Computer networks; Distributed computing; Fault tolerant systems; Hardware; Multicast protocols; Reliability theory; Software prototyping; System testing; Telecommunication network reliability;
  • fLanguage
    English
  • Journal_Title
    Reliability, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0018-9529
  • Type

    jour

  • DOI
    10.1109/24.58723
  • Filename
    58723