DocumentCode :
1705353
Title :
Fault-tolerance with multimodule routers
Author :
Chalasani, Suresh ; Boppana, Rajendra V.
Author_Institution :
Dept. of Electr. & Comput. Eng., Wisconsin Univ., Madison, WI, USA
fYear :
1996
Firstpage :
201
Lastpage :
210
Abstract :
The current multiprocessors such as Cray T3D support interprocessor communication using partitioned dimension-order routers (PDRs). In a PDR implementation, the routing logic and switching hardware is partitioned into multiple modules, with each module suitable for implementation as a chip. This paper proposes a method to incorporate fault-tolerance into such routers with simple changes to the router structure and logic. The previously known fault-tolerant routing methods assume centralized crossbar based routers and are not applicable to multiprocessors with PDRs. The proposed technique works for convex fault model, using only local knowledge of faults. Using the proposed techniques and as few as four virtual channels per physical channel, torus networks with PDRs can handle faults without compromising deadlock- and livelock-freedom. Simulations for 2-dimensional torus and mesh networks show that the resulting fault-tolerant PDRs have performances similar to those of the crossbar based routers
Keywords :
fault tolerant computing; multiprocessor interconnection networks; network routing; 2-dimensional torus; Cray T3D; fault-tolerance; fault-tolerant PDRs; fault-tolerant routing; interprocessor communication; mesh networks; multimodule routers; multiprocessors; partitioned dimension-order routers; routing logic; switching hardware; Communication switching; Computer science; Fault tolerance; Hardware; Logic; Mesh networks; Network topology; Pins; Routing; System recovery;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
High-Performance Computer Architecture, 1996. Proceedings., Second International Symposium on
Conference_Location :
San Jose, CA
Print_ISBN :
0-8186-7237-4
Type :
conf
DOI :
10.1109/HPCA.1996.501186
Filename :
501186
Link To Document :
بازگشت