• DocumentCode
    3047879
  • Title

    Efficient distributed diagnosis in the presence of random faults

  • Author

    Pelc, Andrzej

  • Author_Institution
    Quebec Univ., Hull, Que., Canada
  • fYear
    1993
  • fDate
    22-24 June 1993
  • Firstpage
    462
  • Lastpage
    469
  • Abstract
    A probabilistic setting for distributed system-level diagnosis is studied. Processors in a multiprocessor system fail independently with constant probability 0 < p < 1/2. They can test their neighbors and a fault-free processor has probability 0 < q < 1 of detecting a fault of a failed processor in a single test. The aim of distributed diagnosis is for every fault-free processor to know the status of every other processor. This is achieved by communicating test results throughout the system. Faulty processors can behave arbitrarily, even maliciously, both as testers and as message transmitters. The author proposes a diagnosis scheme using O(n) time, O(n log n) tests, O(n/sup 2/) message bits and working correctly with probability converging to 1, as n grows. It is shown that each of these three characteristics has the lowest possible order of magnitude.
  • Keywords
    multiprocessing systems; distributed diagnosis; distributed system-level diagnosis; fault-free processor; message transmitters; multiprocessor system; probabilistic setting; random faults; Condition monitoring; Councils; Fault detection; Fault diagnosis; Multiprocessing systems; Performance evaluation; Polynomials; System testing; Transmitters; Upper bound;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Fault-Tolerant Computing, 1993. FTCS-23. Digest of Papers., The Twenty-Third International Symposium on
  • Conference_Location
    Toulouse, France
  • ISSN
    0731-3071
  • Print_ISBN
    0-8186-3680-7
  • Type

    conf

  • DOI
    10.1109/FTCS.1993.627349
  • Filename
    627349