DocumentCode
2364261
Title
Fault-tolerant routing in 2D tori or meshes using limited-global-safety information
Author
Xiang, Dong ; Chen, Ai
Author_Institution
Inst. of Microelectron., Tsinghua Univ., Beijing, China
fYear
2002
fDate
2002
Firstpage
231
Lastpage
238
Abstract
A limited-global-safety-information-based metric called local safety is proposed to handle fault-tolerant routing in 2D tori (or meshes). Sufficient conditions for existence of a minimum feasible path between the source and destination is presented based on local safety information in a 2D torus network. An efficient heuristic function is defined to guide fault-tolerant routing inside a 2D torus network. Unlike the conventional methods based on the block fault model, our method does not disable any fault-free nodes and fault-free nodes inside a fault block can still be a source or a destination, which can greatly increase throughput and computational power of the system. Techniques for avoidance of deadlocks are introduced. Extensive simulation results are presented.
Keywords
concurrency control; fault tolerant computing; heuristic programming; multiprocessing systems; 2D torus network; computational power; deadlock avoidance; efficient heuristic function; fault-tolerant routing; limited-global-safety information; meshes; Circuit faults; Fault tolerance; Hypercubes; Microelectronics; Power system modeling; Routing; Safety; Sufficient conditions; System recovery; Throughput;
fLanguage
English
Publisher
ieee
Conference_Titel
Parallel Processing, 2002. Proceedings. International Conference on
ISSN
0190-3918
Print_ISBN
0-7695-1677-7
Type
conf
DOI
10.1109/ICPP.2002.1040878
Filename
1040878
Link To Document