Title :
A low cost fault tolerant packet routing for parallel computers
Author :
Puente, V. ; Gregorio, J.A. ; Beivide, R. ; Vallejo, F.
Author_Institution :
Comput. Archit. Group, Cantabria Univ., Santander, Spain
Abstract :
This paper presents a new switching mechanism to tolerate arbitrary faults in interconnection networks with a negligible implementation cost. Although our routing technique can be applied to any regular or irregular topology, in this paper we focus on its application to k-ary n-cube networks when managing both synthetic and real traffic workloads. Our mechanism is effective regardless the number of faults and their configuration. When the network is working without any fault, no overhead is added to the original routing scheme. In the presence of a low number of faults, the network sustains a performance close to that observed under fault-free conditions. Finally, when the number of faults increases, the system exhibits a graceful performance degradation.
Keywords :
hypercube networks; parallel processing; performance evaluation; arbitraryfaults; graceful performance degradation; k-ary n-cube networks; low cost fault tolerant packet routing; parallel computers; switching mechanism; Application software; Computer network management; Concurrent computing; Costs; Degradation; Fault tolerance; Multiprocessor interconnection networks; Network topology; Routing; Telecommunication traffic;
Conference_Titel :
Parallel and Distributed Processing Symposium, 2003. Proceedings. International
Print_ISBN :
0-7695-1926-1
DOI :
10.1109/IPDPS.2003.1213132