DocumentCode
3047879
Title
Efficient distributed diagnosis in the presence of random faults
Author
Pelc, Andrzej
Author_Institution
Quebec Univ., Hull, Que., Canada
fYear
1993
fDate
22-24 June 1993
Firstpage
462
Lastpage
469
Abstract
A probabilistic setting for distributed system-level diagnosis is studied. Processors in a multiprocessor system fail independently with constant probability 0 < p < 1/2. They can test their neighbors and a fault-free processor has probability 0 < q < 1 of detecting a fault of a failed processor in a single test. The aim of distributed diagnosis is for every fault-free processor to know the status of every other processor. This is achieved by communicating test results throughout the system. Faulty processors can behave arbitrarily, even maliciously, both as testers and as message transmitters. The author proposes a diagnosis scheme using O(n) time, O(n log n) tests, O(n/sup 2/) message bits and working correctly with probability converging to 1, as n grows. It is shown that each of these three characteristics has the lowest possible order of magnitude.
Keywords
multiprocessing systems; distributed diagnosis; distributed system-level diagnosis; fault-free processor; message transmitters; multiprocessor system; probabilistic setting; random faults; Condition monitoring; Councils; Fault detection; Fault diagnosis; Multiprocessing systems; Performance evaluation; Polynomials; System testing; Transmitters; Upper bound;
fLanguage
English
Publisher
ieee
Conference_Titel
Fault-Tolerant Computing, 1993. FTCS-23. Digest of Papers., The Twenty-Third International Symposium on
Conference_Location
Toulouse, France
ISSN
0731-3071
Print_ISBN
0-8186-3680-7
Type
conf
DOI
10.1109/FTCS.1993.627349
Filename
627349
Link To Document