• DocumentCode
    1114900
  • Title

    An Adaptive Programming Model for Fault-Tolerant Distributed Computing

  • Author

    Gorender, Sérgio ; De Araújo Macédo, Raimundo José ; Raynal, Michel

  • Author_Institution
    Dept. of Comput. Sci., Bahia Fed. Univ.
  • Volume
    4
  • Issue
    1
  • fYear
    2007
  • Firstpage
    18
  • Lastpage
    31
  • Abstract
    The capability of dynamically adapting to distinct runtime conditions is an important issue when designing distributed systems where negotiated quality of service (QoS) cannot always be delivered between processes. Providing fault tolerance for such dynamic environments is a challenging task. Considering such a context, this paper proposes an adaptive programming model for fault-tolerant distributed computing, which provides upper-layer applications with process state information according to the current system synchrony (or QoS). The underlying system model is hybrid, composed by a synchronous part (where there are time bounds on processing speed and message delay) and an asynchronous part (where there is no time bound). However, such a composition can vary over time, and, in particular, the system may become totally asynchronous (e.g., when the underlying system QoS degrade) or totally synchronous. Moreover, processes are not required to share the same view of the system synchrony at a given time. To illustrate what can be done in this programming model and how to use it, the consensus problem is taken as a benchmark problem. This paper also presents an implementation of the model that relies on a negotiated quality of service (QoS) for communication channels
  • Keywords
    concurrency theory; distributed programming; quality of service; software fault tolerance; adaptive programming model; benchmark problem; communication channel; consensus problem; distributed system; fault tolerance; fault-tolerant distributed computing; hybrid system model; negotiated quality of service; runtime condition; state information; system synchrony; upper-layer application; Broadcasting; Computer crashes; Delay effects; Distributed computing; Dynamic programming; Fault tolerance; Fault tolerant systems; Protocols; Quality of service; Runtime; Adaptability; asynchronous/synchronous distributed system; consensus; distributed computing model; fault tolerance; quality of service.;
  • fLanguage
    English
  • Journal_Title
    Dependable and Secure Computing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1545-5971
  • Type

    jour

  • DOI
    10.1109/TDSC.2007.3
  • Filename
    4099189