• DocumentCode
    2368673
  • Title

    A probabilistic method for fault diagnosis of multiprocessor systems

  • Author

    Rangarajan, S. ; Fussell, D.

  • Author_Institution
    Dept. of Comput. Sci., Texas Univ., Austin, TX, USA
  • fYear
    1988
  • fDate
    27-30 June 1988
  • Firstpage
    278
  • Lastpage
    283
  • Abstract
    The authors present a system-level fault-diagnosis algorithm for identifying faulty and fault-free units in a homogeneous system of computing elements. The algorithm is based on a comparison approach where tasks are performed by the units and their outputs are compared among themselves. Unlike other approaches, the authors´ algorithm requires no global syndrome analysis and therefore can be performed in real time as a background task during system operation. The time required to perform the diagnosis is constant regardless of the number of units in the system. Like previous global syndrome-based approaches, the accuracy of the algorithm is remarkably high, since it uses information about individual comparison results which is lost when these results are summarized in a global syndrome.<>
  • Keywords
    fault location; fault tolerant computing; multiprocessing systems; fault free processor identification; faulty processor identification; multiprocessor systems; probabilistic method; real time; system-level fault-diagnosis algorithm; Algorithm design and analysis; Costs; Fault detection; Fault diagnosis; Multiprocessing systems; Performance analysis; Performance evaluation; Real time systems; Redundancy; System testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Fault-Tolerant Computing, 1988. FTCS-18, Digest of Papers., Eighteenth International Symposium on
  • Conference_Location
    Tokyo, Japan
  • Print_ISBN
    0-8186-0867-6
  • Type

    conf

  • DOI
    10.1109/FTCS.1988.5332
  • Filename
    5332