Title :
Deadlock-Free Oblivious Routing for Arbitrary Topologies
Author :
Domke, Jens ; Hoefler, Torsten ; Nagel, Wolfgang E.
Author_Institution :
Center for Inf. Services & High Performance Comput., Tech. Univ. Dresden, Dresden, Germany
Abstract :
Efficient deadlock-free routing strategies are crucial to the performance of large-scale computing systems. There are many methods but it remains a challenge to achieve lowest latency and highest bandwidth for irregular or unstructured high-performance networks. % We investigate a novel routing strategy based on the single-source-shortest-path routing algorithm and extend it to use virtual channels to guarantee deadlock-freedom. We show that this algorithm achieves minimal latency and high bandwidth with only a low number of virtual channels and can be implemented in practice. % We demonstrate that the problem of finding the minimal number of virtual channels needed to route a general network deadlock-free is NP-complete and we propose different heuristics to solve the problem. We implement all proposed algorithms in the Open Subnet Manager of InfiniBand and compare the number of needed virtual channels and the bandwidths of multiple real and artificial network topologies which are established in practice. % Our approach allows to use the existing virtual channels more effectively to guarantee deadlock-freedom and increase the effective bandwidth of up to a factor of two. Application benchmarks show an improvement of up to 95%. Our routing scheme is not limited to InfiniBand but can be deployed on existing InfiniBand installations to increase network performance transparently without modifications to the user applications.
Keywords :
computational complexity; computer networks; graph theory; telecommunication network routing; telecommunication network topology; NP-complete; arbitrary topologies; artificial network topologies; deadlock-free oblivious routing; large-scale computing systems; single-source-shortest-path routing algorithm; unstructured high-performance networks; virtual channels; Bandwidth; Generators; Network topology; Polynomials; Routing; System recovery; Topology;
Conference_Titel :
Parallel & Distributed Processing Symposium (IPDPS), 2011 IEEE International
Conference_Location :
Anchorage, AK
Print_ISBN :
978-1-61284-372-8
Electronic_ISBN :
1530-2075
DOI :
10.1109/IPDPS.2011.65