DocumentCode :
3017916
Title :
Design and Implementation of Open MPI over Quadrics/Elan4
Author :
Yu, Weikuan ; Woodall, Tim S. ; Graham, Rich L. ; Panda, Dhabaleswar K.
Author_Institution :
Dept. of Comput. Sci. & Eng., Ohio State Univ., Columbus, OH, USA
fYear :
2005
fDate :
04-08 April 2005
Abstract :
Open MPI is a project recently initiated to provide a fault-tolerant, multi-network capable implementation of MPI-2, based on experiences gained from FT-MPI, LA-MPI, LAM/MPI, and MVAPICH projects. Its initial communication architecture is layered on top of TCP/IP. In this paper, we have designed and implemented Open MPI point-to-point layer on top of a high-end interconnect, Quadrics/Elan4. The restriction of Quadrics static process model has been overcome to accommodate the requirement of MPI-2 dynamic process management. Quadrics Queued-based Direct Memory Access (QDMA) and Remote Direct Memory Access (RDMA) mechanisms have been integrated to form a low-overhead, high-performance transport layer. Lightweight asynchronous progress is made possible with a combination of Quadrics chained event and QDMA mechanisms. Experimental results indicate that the resulting point-to-point transport layer is able to achieve comparable performance to Quadrics native QDMA operations, from which it is derived. Our implementation provides an MPI-2 compliant message passing library over Quadrics/Elan4 with a performance comparable to MPICH-Quadrics.
Keywords :
fault tolerant computing; message passing; open systems; transport protocols; MPI-2 dynamic process management; Quadrics queued-based direct memory access; Quadrics static process model; Quadrics/Elan4; TCP/IP; communication architecture; message passing library; open MPI; remote direct memory access; Bandwidth; Computer networks; Fault tolerance; Laboratories; Libraries; Message passing; National security; Protocols; TCPIP; US Department of Energy;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel and Distributed Processing Symposium, 2005. Proceedings. 19th IEEE International
Print_ISBN :
0-7695-2312-9
Type :
conf
DOI :
10.1109/IPDPS.2005.163
Filename :
1419923
Link To Document :
بازگشت