• DocumentCode
    911156
  • Title

    The MAFT architecture for distributed fault tolerance

  • Author

    Keichafer, R.M. ; Walter, Chris J. ; Finn, Alan M. ; Thambidurai, Philip M.

  • Author_Institution
    Allied Signal Corp., Columbia, MD, USA
  • Volume
    37
  • Issue
    4
  • fYear
    1988
  • fDate
    4/1/1988 12:00:00 AM
  • Firstpage
    398
  • Lastpage
    404
  • Abstract
    A description is given of the multicomputer architecture for fault tolerance (MAFT), a distributed system designed to provide extremely reliable computation in real-time control systems. MAFT is based on the physical and functional partitioning of executive functions from applications functions. The implementation of the executive functions in a special-purpose hardware processor allows the fault-tolerance functions to be transparent to the application programs and minimizes overhead. Byzantine agreement and approximate agreement algorithms are used for critical system parameters. MAFT supports the use of multiversion hardware and software to tolerate built-in or generic faults. Graceful degradation and restoration of the application workload is permitted in response to the exclusion and readmission of nodes, respectively
  • Keywords
    computer architecture; distributed processing; fault tolerant computing; Byzantine agreement; MAFT architecture; application programs; approximate agreement algorithms; distributed fault tolerance; functional partitioning; multicomputer architecture for fault tolerance; real-time control systems; special-purpose hardware processor; Application software; Computer architecture; Control systems; Distributed computing; Embedded software; Fault tolerance; Fault tolerant systems; Hardware; Partitioning algorithms; Real time systems;
  • fLanguage
    English
  • Journal_Title
    Computers, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0018-9340
  • Type

    jour

  • DOI
    10.1109/12.2183
  • Filename
    2183