DocumentCode :
2663102
Title :
Hot-Spot Avoidance With Multi-Pathing Over InfiniBand: An MPI Perspective
Author :
Vishnu, A. ; Koop, M. ; Moody, A. ; Mamidala, A.R. ; Narravula, S. ; Panda, D.K.
Author_Institution :
Ohio State Univ., Columbus, OH
fYear :
2007
fDate :
14-17 May 2007
Firstpage :
479
Lastpage :
486
Abstract :
Large scale InfiniBand clusters are becoming increasingly popular, as reflected by the TOP 500 supercomputer rankings. At the same time, fat tree has become a popular interconnection topology for these clusters, since it allows multiple paths to be available in between a pair of nodes. However, even with fat tree, hot-spots may occur in the network depending upon the route configuration between end nodes and communication pattern(s) in the application. To make matters worse, the deterministic routing nature of InfiniBand limits the application from effective use of multiple paths transparently and avoid the hot-spots in the network. Simulation based studies for switches and adapters to implement congestion control have been proposed in the literature. However, these studies have focussed on providing congestion control for the communication path, and not on utilizing multiple paths in the network for hot-spot avoidance. In this paper, we design an MPI functionality, which provides hot-spot avoidance for different communications, without a priori knowledge of the pattern. We leverage LMC (LID mask count) mechanism of InfiniBand to create multiple paths in the network and present the design issues (scheduling policies, selecting number of paths, scalability aspects) of our design. We implement our design and evaluate it with Pallas collective communication and MPI applications. On an InfiniBand cluster with 48 processes, MPI All-to-all personalized shows an improvement of 27%. Our evaluation with NAS parallel benchmarks on 64 processes shows significant improvement in execution time with this functionality.
Keywords :
message passing; telecommunication congestion control; telecommunication network routing; InfiniBand cluster; MPI functionality; congestion control; deterministic routing; multiple paths; Communication switching; Communication system control; Computer science; Large-scale systems; Network topology; Routing; Scalability; Sun; Supercomputers; Switches;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Cluster Computing and the Grid, 2007. CCGRID 2007. Seventh IEEE International Symposium on
Conference_Location :
Rio De Janeiro
Print_ISBN :
0-7695-2833-3
Type :
conf
DOI :
10.1109/CCGRID.2007.60
Filename :
4215414
Link To Document :
بازگشت