• DocumentCode
    3064572
  • Title

    Availability study of dynamic voting algorithms

  • Author

    Ingols, Kyle ; Keidar, Idit

  • Author_Institution
    Lab. for Comput. Sci., MIT, Cambridge, MA, USA
  • fYear
    2001
  • fDate
    36982
  • Firstpage
    247
  • Lastpage
    254
  • Abstract
    Fault-tolerant distributed systems often select a primary component to allow a subset of the processes to function when failures occur. The dynamic voting paradigm defines rules for selecting the primary component adaptively: when a partition occurs, if a majority of the previous primary component is connected, a new and possibly smaller primary component is chosen. Several studies have shown that dynamic voting leads to more available solutions than other paradigms for maintaining a primary component. However, these studies have assumed that every attempt made by the algorithm to form a new primary component terminates successfully. Unfortunately, in real systems, this is not always the case: a change in connectivity can interrupt the algorithm while it is still attempting to form a new primary component; in such cases, algorithms may block until the processes can resolve the outcome of the interrupted attempt. This paper uses simulations to evaluate the effect of interruptions on the availability of dynamic voting algorithm. We study four dynamic voting algorithms and identify two important characteristics that impact an algorithm´s availability in runs with frequent connectivity changes. First, we show that the number of processes that need to be present in order to resolve past attempts impacts the availability, especially during long runs with numerous connectivity changes. Second, we show that the number of communication rounds exchanged in an algorithm plays a significant role in the availability achieved, especially in the degradation of availability as connectivity changes become more frequent
  • Keywords
    distributed algorithms; software fault tolerance; algorithm interruption; algorithm termination; availability; communication rounds; connectivity changes; dynamic voting algorithms; fault-tolerant distributed systems; partition; primary component selection; process failures; simulations; Aerodynamics; Computer science; Contracts; Degradation; Fault tolerant systems; Heuristic algorithms; Oils; Partitioning algorithms; Telegraphy; Voting;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Distributed Computing Systems, 2001. 21st International Conference on.
  • Conference_Location
    Mesa, AZ
  • Print_ISBN
    0-7695-1077-9
  • Type

    conf

  • DOI
    10.1109/ICDSC.2001.918954
  • Filename
    918954