DocumentCode :
3144144
Title :
ConnectX-2 CORE-Direct Enabled Asynchronous Broadcast Collective Communications
Author :
Venkata, Manjunath Gorentla ; Graham, Richard L. ; Ladd, Joshua S. ; Shamis, Pavel ; Rabinovitz, Ishai ; Filipov, Vasily ; Shainer, Gilad
Author_Institution :
Comput. Sci. & Math. Div., Oak Ridge Nat. Lab., Oak Ridge, TN, USA
fYear :
2011
fDate :
16-20 May 2011
Firstpage :
781
Lastpage :
787
Abstract :
This paper describes the design and implementation of InfiniBand (IB) CORE-Direct based blocking and nonblocking broadcast operations within the Cheetah collective operation framework. It describes a novel approach that fully offloads collective operations and employs only user-supplied buffers. For a 64 rank communicator, the latency of CORE-Direct based hierarchical algorithm is better than production grade Message Passing Interface (MPI) implementations, 150% better than the default Open MPI algorithm and 115% better than the shared memory optimized MVAPICH implementation for a one kilo-byte (KB) message, and for eight mega-bytes (MB) it is 48% and 64% better, respectively. Flat-topology broadcast achieves 99.9% overlap in a polling based communication-computation test, and 95.1% overlap for a wait based test, compared with 92.4% and 17.0%, respectively, for a similar Central Processing Unit (CPU) based implementation.
Keywords :
application program interfaces; message passing; CORE-Direct based hierarchical algorithm; Cheetah collective operation framework; ConnectX-2 CORE-Direct; InfiniBand CORE-Direct; asynchronous broadcast collective communication; blocking broadcast operation; flat-topology broadcast; message passing interface; nonblocking broadcast operation; polling based communication-computation test; rank communicator; user-supplied buffer; Algorithm design and analysis; Benchmark testing; Central Processing Unit; Libraries; Receivers; Scalability; Size measurement;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel and Distributed Processing Workshops and Phd Forum (IPDPSW), 2011 IEEE International Symposium on
Conference_Location :
Shanghai
ISSN :
1530-2075
Print_ISBN :
978-1-61284-425-1
Electronic_ISBN :
1530-2075
Type :
conf
DOI :
10.1109/IPDPS.2011.221
Filename :
6008920
Link To Document :
بازگشت