DocumentCode
3203899
Title
Fault-tolerant multiprocessor system routing using incomplete diagnostic information
Author
Blough, Douglas M. ; Najand, Shahriar
Author_Institution
Dept. of Electr. & Comput. Eng., California Univ., Irvine, CA, USA
fYear
1992
fDate
23-26 Mar 1992
Firstpage
398
Lastpage
402
Abstract
Fault-tolerant routing algorithms in multiprocessor systems utilize diagnostic information in selecting paths for messages. In many situations, only incomplete, or partial, diagnostic information is available for this purpose. The authors present algorithms for achieving two forms of diagnosis, known as k -reachability diagnosis and k -neighborhood diagnosis which provide partial diagnostic information. They compare, both analytically and through experiments conducted on an Intel iPSC/2 hypercube the performance and overhead of these two algorithms. They also present a routing algorithm that successfully routes messages between connected non-faulty nodes in systems of arbitrary topology containing an arbitrary number of faults. The performance of the algorithm is shown to be optimal when k = n -1 and within a factor of two of optimal, in the worst case, when k =1
Keywords
fault tolerant computing; hypercube networks; network routing; parallel algorithms; performance evaluation; Intel iPSC/2 hypercube; algorithm performance; connected non-faulty nodes; fault tolerant routing algorithms; incomplete diagnostic information; k-neighborhood diagnosis; k-reachability diagnosis; message path selection; message routing; multiprocessor systems; partial diagnostic information; Algorithm design and analysis; Fault diagnosis; Fault tolerance; Fault tolerant systems; Hypercubes; Large-scale systems; Multiprocessing systems; Performance analysis; Routing; System testing;
fLanguage
English
Publisher
ieee
Conference_Titel
Parallel Processing Symposium, 1992. Proceedings., Sixth International
Conference_Location
Beverly Hills, CA
Print_ISBN
0-8186-2672-0
Type
conf
DOI
10.1109/IPPS.1992.223013
Filename
223013
Link To Document