• DocumentCode
    1996548
  • Title

    Chasing the FLP impossibility result in a LAN: or, How robust can a fault tolerant server be?

  • Author

    Urbán, Péter ; Défago, Xavier ; Schiper, André

  • Author_Institution
    Ecole Polytech. Fed. de Lausanne, Switzerland
  • fYear
    2001
  • fDate
    2001
  • Firstpage
    190
  • Lastpage
    193
  • Abstract
    Fault tolerance can be achieved in distributed systems by replication. However Fischer, Lynch and Paterson (1985) have proven an impossibility result about consensus in the asynchronous system model, and similar impossibility results exist for atomic broadcast and group membership. We investigate, with the aid of an experiment conducted in a LAN, whether these impossibility results set limits to the robustness of a replicated server exposed to extremely high loads. The experiment consists of client processes that send requests to a replicated server (three replicas) using an atomic broadcast primitive. It has parameters that allow us to control the load on the hosts and the network, as well as the timeout value used by our heartbeat failure detection mechanism. Our main observation is that the atomic broadcast algorithm never stops delivering messages, not even under arbitrarily high load and very small timeout values (1 ms). So, by trying to illustrate the practical impact of impossibility results, we discovered that we had implemented a very robust replicated service
  • Keywords
    client-server systems; fault tolerant computing; local area networks; network servers; replicated databases; FLP impossibility result; LAN; asynchronous system model; atomic broadcast; distributed systems; fault tolerant server; group membership; heartbeat failure detection mechanism; impossibility result; replication; timeout value; Broadcasting; Detectors; Fault tolerance; Fault tolerant systems; Frequency synchronization; Heart beat; Local area networks; Network servers; Protocols; Robustness;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Reliable Distributed Systems, 2001. Proceedings. 20th IEEE Symposium on
  • Conference_Location
    New Orleans, LA
  • ISSN
    1060-9857
  • Print_ISBN
    0-7695-1366-2
  • Type

    conf

  • DOI
    10.1109/RELDIS.2001.970767
  • Filename
    970767