• DocumentCode
    1422754
  • Title

    Fault-tolerant computing: fundamental concepts

  • Author

    Nelson, Victor P.

  • Author_Institution
    Auburn Univ., AL, USA
  • Volume
    23
  • Issue
    7
  • fYear
    1990
  • fDate
    7/1/1990 12:00:00 AM
  • Firstpage
    19
  • Lastpage
    25
  • Abstract
    The basic concepts of fault-tolerant computing are reviewed, focusing on hardware. Failures, faults, and errors in digital systems are examined, and measures of dependability, which dictate and evaluate fault-tolerance strategies for different classes of applications, are defined. The elements of fault-tolerance strategies are identified, and various strategies are reviewed. They are: error detection, masking, and correction; error detection and correction codes; self-checking logic; module replication for error detection and masking; protocol and timing checks; fault containment; reconfiguration and repair; and system recovery.<>
  • Keywords
    fault tolerant computing; correction; digital systems; error detection; errors; fault containment; fault-tolerant computing; hardware; masking; module replication; protocol; reconfiguration; repair; self-checking logic; system recovery; timing checks; Digital systems; Error correction codes; Fault detection; Fault diagnosis; Fault tolerance; Fault tolerant systems; Hardware; Protocols; Reconfigurable logic; Timing;
  • fLanguage
    English
  • Journal_Title
    Computer
  • Publisher
    ieee
  • ISSN
    0018-9162
  • Type

    jour

  • DOI
    10.1109/2.56849
  • Filename
    56849