DocumentCode
299712
Title
Configurable flow control mechanisms for fault-tolerant routing
Author
Dao, Binh Vien ; Duato, Jose ; Yalamanchili, Sudhakar
Author_Institution
Comput. Syst. Res. Lab., Georgia Inst. of Technol., Atlanta, GA, USA
fYear
1995
fDate
22-24 June 1995
Firstpage
220
Lastpage
229
Abstract
Fault-tolerant routing protocols in modern interconnection networks rely heavily on the network flow control mechanisms used. Optimistic flow control mechanisms such as wormhole routing (WR) realize very good performance, but are prone to deadlock in the presence of faults. Conservative flow control mechanisms such as pipelined circuit switching (PCS) insures existence of a path to the destination prior to message transmission, but incurs increased overhead. Existing fault-tolerant routing protocols are designed with one or the other, and must accommodate their associated constraints. This paper proposes the use of configurable flow control mechanisms. Routing protocols can then be designed such that in the vicinity of faults, protocols use a more conservative flow control mechanism, while the majority of messages that traverse fault-free portions of the network utilize a WR like flow control to maximize performance. Such protocols are referred to as two-phase protocols where routing decisions are provided some control over the operation of the virtual channels. This ability provides new avenues for optimizing message passing performance in the presence of faults. A fully adaptive two-phase protocol is proposed and compared via simulation to those based on WR and PCS. The architecture of a network router supporting configurable flow control is described, and the paper concludes with avenues for future research.
Keywords
fault tolerant computing; message passing; multiprocessor interconnection networks; protocols; configurable flow control mechanisms; fault-free portions; fault-tolerant routing; interconnection networks; message passing performance; message transmission; pipelined circuit switching; protocols; wormhole routing; Circuit faults; Computer networks; Delay; Fault tolerance; Permission; Personal communication networks; Protocols; Routing; System recovery; Throughput;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Architecture, 1995. Proceedings., 22nd Annual International Symposium on
Conference_Location
Santa Margherita Ligure, Italy
ISSN
1063-6897
Print_ISBN
0-89791-698-0
Type
conf
Filename
524563
Link To Document