Title :
The analysis and optimization of collective communications on a Beowulf cluster
Author :
Tan, Wi Bing ; Strazdins, Peter
Author_Institution :
Dept. of Comput. Sci., Australian Nat. Univ., Canberra, ACT, Australia
Abstract :
This paper gives a performance analysis of the all-gather, all-reduce and reduce-scatter collective communication operations on a Beowulf cluster. This cluster has a contention-free switch-based network with multiple network interface cards per node, permitting overlapping of message transmission under certain circumstances. As well as considering traditional algorithms developed previously for parallel computers with vendor-specific networks, we also examine simpler algorithms made up of repeated sub-operations, such as broadcasts. We find that for the kind of network on the Beowulf cluster, a somewhat different performance modelling of the algorithms is required, and that some simple simulation tools had to be developed in order to fully understand some of the algorithms´ performance. Our results indicate that the LAM MPI implementations for these operations may be significantly improved, and the algorithms with data exchange and potential contention perform well on the cluster. Furthermore, they indicate that algorithms permitting message overlap are slightly favoured, with a new and simple algorithm which modestly out-performs the best traditional algorithms in the case of Reduce-Scatter. With the exception that the degree of overlapping proved difficult to estimate, our performance models fitted closely with the results, and together with the simulation tools, permit a detailed understanding of the cluster´s communication pattern performance.
Keywords :
application program interfaces; message passing; performance evaluation; workstation clusters; Beowulf cluster; LAM MPI implementations; all-gather collective communication operations; all-reduce collective communication operations; broadcasts; communication pattern performance; contention-free switch-based network; data exchange; message overlap; message transmission; network interface cards; optimization; parallel computers; performance modelling; reduce-scatter collective communication operations; repeated suboperations; simulation tools; Broadcasting; Clustering algorithms; Communication networks; Communication switching; Computer networks; Computer science; Concurrent computing; Network interfaces; Pattern analysis; Performance analysis;
Conference_Titel :
Parallel and Distributed Systems, 2002. Proceedings. Ninth International Conference on
Print_ISBN :
0-7695-1760-9
DOI :
10.1109/ICPADS.2002.1183497