DocumentCode :
1085216
Title :
Routing in modular fault-tolerant multiprocessor systems
Author :
Alam, M. Sultan ; Melhem, Rami G.
Author_Institution :
AT&T Bell Labs., Redhill, NJ, USA
Volume :
6
Issue :
11
fYear :
1995
fDate :
11/1/1995 12:00:00 AM
Firstpage :
1206
Lastpage :
1220
Abstract :
In this paper, we consider a class of modular multiprocessor architectures in which spares are added to each module to cover for faulty nodes within that module, thus forming a fault-tolerant basic block (FTBB). In contrast to reconfiguration techniques that preserve the physical adjacency between active nodes in the system, our goal is to preserve the logical adjacency between active nodes by means of a routing algorithm which delivers messages successfully to their destinations. We introduce two-phase routing strategies that route messages first to their destination FTBB, and then to the destination nodes within the destination FTBB. Such a strategy may be applied to a variety of architectures including binary hypercubes and three-dimensional tori. In the presence of f faults in hypercubes and tori, we show that the worst case length of the message route is min {σ+f, (K+1)σ}+c where σ is the shortest path in the absence of faults, K is the number of spare nodes in an FTBB, and c is a small constant. The average routing overhead is much lower than the worst case overhead
Keywords :
fault tolerant computing; multiprocessing systems; multiprocessor interconnection networks; network routing; parallel architectures; average routing overhead; fault-tolerant basic block; fault-tolerant multiprocessor systems; faulty nodes; hypercubes; logical adjacency; multiprocessor architectures; tori; worst case overhead; Computer Society; Computer architecture; Degradation; Fault tolerance; Fault tolerant systems; Hypercubes; Multiprocessing systems; Real time systems; Routing; Topology;
fLanguage :
English
Journal_Title :
Parallel and Distributed Systems, IEEE Transactions on
Publisher :
ieee
ISSN :
1045-9219
Type :
jour
DOI :
10.1109/71.476192
Filename :
476192
Link To Document :
بازگشت