• DocumentCode
    2549189
  • Title

    Host Side Dynamic Reconfiguration with InfiniBand

  • Author

    Guay, Wei Lin ; Reinemo, Sven-Arne ; Lysne, Olav ; Skeie, Tor ; Johnsen, Bjørn Dag ; Holen, Line

  • Author_Institution
    Simula Res. Lab., Lysaker, Norway
  • fYear
    2010
  • fDate
    20-24 Sept. 2010
  • Firstpage
    126
  • Lastpage
    135
  • Abstract
    Rerouting around faulty components and migration of jobs both require reconfiguration of data structures in the Queue Pairs residing in the hosts on an InfiniBand cluster. In this paper we report an implementation of dynamic reconfiguration of such host side data-structures. Our implementation preserves the Queue Pairs, and lets the application run without being interrupted. With this implementation, we demonstrate a complete solution to fault tolerance in an InfiniBand network, where dynamic network reconfiguration to a topology-agnostic routing function is used to avoid malfunctioning components. This solution is in principle able to let applications run uninterruptedly on the cluster, as long as the topology is physically connected. Through measurements on our test-cluster we show that the increased cost of our method in setup latency is negligible, and that there is only a minor reduction in throughput during reconfiguration.
  • Keywords
    computer networks; telecommunication network routing; InfiniBand cluster; host side data-structures; host side dynamic reconfiguration; queue pairs; topology-agnostic routing function; Fault tolerance; Fault tolerant systems; Network topology; Proposals; Routing; System recovery; Topology; Automatic path migration; Dynamic reconfiguration; InfiniBand; fault tolerance;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Cluster Computing (CLUSTER), 2010 IEEE International Conference on
  • Conference_Location
    Heraklion, Crete
  • Print_ISBN
    978-1-4244-8373-0
  • Electronic_ISBN
    978-0-7695-4220-1
  • Type

    conf

  • DOI
    10.1109/CLUSTER.2010.21
  • Filename
    5600315