• DocumentCode
    2181326
  • Title

    Extending collective operations with application semantics for improving multi-cluster performance

  • Author

    Bongo, Lars Ailo ; Anshus, Otto ; Bjørndalen, John Markus ; Larsen, Tore

  • Author_Institution
    Dept. of Comput. Sci., Tromso Univ., Norway
  • fYear
    2004
  • fDate
    5-7 July 2004
  • Firstpage
    320
  • Lastpage
    327
  • Abstract
    We identify two ways of increasing the performance of allreduce-style of collective operations in a multi-cluster with large WAN latencies: (i) hiding latency in system noise, and (ii) conditional-allreduce where knowledge about the application is used to reduce the number of WAN messages. In our multicluster, system noise was not large enough to hide the WAN latency. But, the latency could be hidden using conditional-allreduce, since on many iterations only cluster-local values were needed, and many of the values needed from other clusters were prefetched. A speedup of 2.4 was achieved for a microbenchmark. Prefetching introduced a small overhead in the cluster with the slowest hosts.
  • Keywords
    application program interfaces; grid computing; message passing; performance evaluation; storage management; wide area networks; workstation clusters; MPI; WAN latency; WAN messages; allreduce-style collective operations; application knowledge; application semantics; cluster-local values; computational grids; conditional-allreduce; latency hiding; message passing interface; microbenchmark; multicluster performance; prefetching; system noise; Application software; Bandwidth; Broadcasting; Clustering algorithms; Delay; Grid computing; Iterative algorithms; Prefetching; Wide area networks; Yarn;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel and Distributed Computing, 2004. Third International Symposium on/Algorithms, Models and Tools for Parallel Computing on Heterogeneous Networks, 2004. Third International Workshop on
  • Print_ISBN
    0-7695-2210-6
  • Type

    conf

  • DOI
    10.1109/ISPDC.2004.24
  • Filename
    1372083