• DocumentCode
    1686273
  • Title

    A generic log-service supporting fast recovery in distributed fault-tolerant systems

  • Author

    Wirz, B. ; Nett, E.

  • Author_Institution
    Gesellschaft fuer Mathematik und Datenverabeitung, Augustin, Germany
  • fYear
    1993
  • fDate
    10/6/1993 12:00:00 AM
  • Firstpage
    121
  • Lastpage
    126
  • Abstract
    Logs are an important facility for fault-tolerant distributed systems since they allow to reliably store information that is needed to provide a global consistent system state also in the presence of failures. The authors focus on the problem of fast recovery after a node crash. The approach is mainly based on minimizing the number of log records to be retrieved. This is achieved by periodically discarding obsolete information in a very efficient manner without effecting the normal logging procedure. The main idea behind is that the generic Log-Service provides a high level interface to the application which allows the Log-Service itself to interpret the semantics of log records without consulting the application during run-time. In addition, the authors are able to reduce the overhead in analyzing the log contents during restart by scanning the log only once and only forward
  • Keywords
    distributed processing; fault tolerant computing; finite state machines; system recovery; distributed fault-tolerant systems; distributed systems; fast recovery; log records; log-service; Automata; Computer crashes; Error correction; Fault tolerant systems; History; Protocols; Runtime;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Advances in Parallel and Distributed Systems, 1993., Proceedings of the IEEE Workshop on
  • Conference_Location
    Princeton, NJ
  • Print_ISBN
    0-8186-5250-0
  • Type

    conf

  • DOI
    10.1109/APADS.1993.588858
  • Filename
    588858