DocumentCode
2181326
Title
Extending collective operations with application semantics for improving multi-cluster performance
Author
Bongo, Lars Ailo ; Anshus, Otto ; Bjørndalen, John Markus ; Larsen, Tore
Author_Institution
Dept. of Comput. Sci., Tromso Univ., Norway
fYear
2004
fDate
5-7 July 2004
Firstpage
320
Lastpage
327
Abstract
We identify two ways of increasing the performance of allreduce-style of collective operations in a multi-cluster with large WAN latencies: (i) hiding latency in system noise, and (ii) conditional-allreduce where knowledge about the application is used to reduce the number of WAN messages. In our multicluster, system noise was not large enough to hide the WAN latency. But, the latency could be hidden using conditional-allreduce, since on many iterations only cluster-local values were needed, and many of the values needed from other clusters were prefetched. A speedup of 2.4 was achieved for a microbenchmark. Prefetching introduced a small overhead in the cluster with the slowest hosts.
Keywords
application program interfaces; grid computing; message passing; performance evaluation; storage management; wide area networks; workstation clusters; MPI; WAN latency; WAN messages; allreduce-style collective operations; application knowledge; application semantics; cluster-local values; computational grids; conditional-allreduce; latency hiding; message passing interface; microbenchmark; multicluster performance; prefetching; system noise; Application software; Bandwidth; Broadcasting; Clustering algorithms; Delay; Grid computing; Iterative algorithms; Prefetching; Wide area networks; Yarn;
fLanguage
English
Publisher
ieee
Conference_Titel
Parallel and Distributed Computing, 2004. Third International Symposium on/Algorithms, Models and Tools for Parallel Computing on Heterogeneous Networks, 2004. Third International Workshop on
Print_ISBN
0-7695-2210-6
Type
conf
DOI
10.1109/ISPDC.2004.24
Filename
1372083
Link To Document