• DocumentCode
    970113
  • Title

    Resilient Distributed Computing

  • Author

    Svobodova, Liba

  • Author_Institution
    Institut National de Recherche en Informatique et an Automatique, Rocquencourt, France.; IBM Zÿrich Research Laboratory, 8803 Rÿschlikon, Switzerland.
  • Issue
    3
  • fYear
    1984
  • fDate
    5/1/1984 12:00:00 AM
  • Firstpage
    257
  • Lastpage
    268
  • Abstract
    A control abstraction called atomic action is a powerful general mechanism for ensuring consistent behavior of a system in spite of failures of individual computations running in the system, and in spite of system crashes. However, because of the ``all-or-nothing´´ property of atomic actions, an important amount of work might be abandoned needlessly when an internal error is encountered. This paper discusses how implementation of resilient distributed systems can be supported using a combination of nested atomic actions and stable checkpoints. Nested atomic actions form a tree structure. When an internal atomic action terminates, its results are not made permanent until the outermost atomic action commits, but they survive local node failures. Each subtree of atomic actions is recoverable individually. A checkpoint is established in stable storage as part of a remote request so that results of such a request can be reclaimed if the requesting node fails in the meantime, The paper shows how remote procedure call primitives with ``at-most-once´´ semantics and recovery blocks can be built with these mechanisms.
  • Keywords
    Computer crashes; Distributed computing; Distributed processing; Fault tolerant systems; Hardware; Helium; Permission; Power system modeling; Tree data structures; Uncertainty; Atomic actions; checkpoints; consistency; crash resistance; distributed programs; distributed systems; recoverability; remote procedure calls;
  • fLanguage
    English
  • Journal_Title
    Software Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0098-5589
  • Type

    jour

  • DOI
    10.1109/TSE.1984.5010234
  • Filename
    5010234