Title :
Efficient RDMA-based multi-port collectives on multi-rail QsNet/sup II/ clusters
Author :
Qian, Ying ; Afsahi, Ahmad
Author_Institution :
Dept. of Electr. & Comput. Eng., Queen´´s Univ.
Abstract :
Many scientific applications use MPI collective communications intensively. Therefore, efficient and scalable implementation of collective operations is critical to the performance of such applications running on clusters. Quadrics QsNetII is a high-performance interconnect for clusters that implements some collectives at the Elan level. These collectives are directly used by their corresponding MPI collectives. Quadrics software supports point-to-point striping over multi-rail QsNetII networks. However, multi-rail collectives have not been supported. In this work, we propose a number of RDMA-based multi-port collectives over multi-rail QsNetII clusters directly at the Elan level. Our performance results indicate that the proposed multi-port gather gains an improvement of up to 6.35 for 1MB message over the native elan_gather. The proposed multi-port all-to-all performs better than the native elan_alltoall by a factor of 2.19 for 16KB message. Moreover, we have also proposed two algorithms for the scatter operation
Keywords :
message passing; workstation clusters; Elan level; message passing interface collectives; multiport all-to-all; multirail QsNetII clusters; multirail QsNetII networks; native elan_alltoall; native elan_gather; point-to-point striping; quadrics QsNetII; quadrics software; remote direct memory access-based multiport collectives; Application software; Clustering algorithms; Communication system software; Computer networks; Libraries; Multiprocessor interconnection networks; Performance gain; Rails; Scalability; Scattering;
Conference_Titel :
Parallel and Distributed Processing Symposium, 2006. IPDPS 2006. 20th International
Conference_Location :
Rhodes Island
Print_ISBN :
1-4244-0054-6
DOI :
10.1109/IPDPS.2006.1639563