Title :
Diagnosis and repair in multiprocessor systems
Author :
Blough, Douglas M. ; Pelc, Andrzej
Author_Institution :
Dept. of Electr. & Comput. Eng., California Univ., Irvine, CA, USA
fDate :
2/1/1993 12:00:00 AM
Abstract :
Diagnosis of multiprocessor systems in which faulty processors can be replaced by spares or repaired is known as sequential diagnosis. A generalization is considered of classical sequential diagnosis, referred to as diagnosis and repair, under a probabilistic model for the faults and test outcomes in a system. It is shown that correct diagnosis and repair of all faulty processors can be achieved with high probability in a large class of systems including, for example, rings, grids, meshes, tori, and hypercubes. These results show, without restrictive assumptions on the behavior of faulty processors, that correct diagnosis can be achieved in these widely used, low-degree systems when a fixed percentage of the processors in the system are faulty
Keywords :
fault tolerant computing; multiprocessing systems; probability; diagnosis and repair; faulty processors; grids; hypercubes; meshes; multiprocessor systems; probabilistic model; rings; sequential diagnosis; tori; Councils; Fault diagnosis; Hypercubes; Multiprocessing systems; Performance evaluation; Sequential analysis; Sequential diagnosis; System testing; Topology;
Journal_Title :
Computers, IEEE Transactions on