• DocumentCode
    2298686
  • Title

    On the design and implementation of broadcast and global combine operations using the postal model

  • Author

    Bruck, Jehoshua ; de Coster, L. ; Dewulf, Natalie ; Ho, Ching-Tien ; Lauwereins, Rudy

  • Author_Institution
    IBM Almaden Res. Center, San Jose, CA, USA
  • fYear
    1994
  • fDate
    26-29 Oct 1994
  • Firstpage
    594
  • Lastpage
    602
  • Abstract
    Two models for message passing parallel systems are the postal model and its generalization, the LogP model. In the postal model a parameter λ is used to model the communication latency of the message-passing system. Each node during each round can send a fixed-size message and simultaneously, receive a message of the same size. Furthermore, a message sent out during round r will incur a latency of λ and will arrive at the receiving node at round r+λ-1. The goal of the article is to bridge the gap between the theoretical modeling and the practical implementation. In particular we investigate a number of practical issues related to the design and implementation of two collective communication operations, namely, the broadcast operation, and the global combine operation. Those practical issues include, for example: techniques for measurement of the value of λ on a given machine; creating efficient broadcast algorithms that get the latency λ and the number of nodes n as parameters; and creating efficient global combine algorithms for parallel machines with λ which is not an integer. We propose solutions that address those practical issues and present results of an experimental study of the new algorithms on the Intel Delta machine. The main conclusion is that the postal model can help in performance prediction and tuning, for example, a properly tuned broadcast improves the known implementation by more than 20%
  • Keywords
    broadcasting; communication complexity; message passing; parallel algorithms; parallel machines; performance evaluation; Intel Delta machine; LogP model; broadcast algorithms; collective communication operations; communication latency; fixed-size message; global combine operations; latency; message passing parallel systems; parallel machines; performance prediction; postal model; theoretical modeling; Bridges; Broadcasting; Delay; Libraries; Message passing; Parallel machines; Postal services; Power system modeling; Predictive models; User interfaces;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel and Distributed Processing, 1994. Proceedings. Sixth IEEE Symposium on
  • Conference_Location
    Dallas, TX
  • Print_ISBN
    0-8186-6427-4
  • Type

    conf

  • DOI
    10.1109/SPDP.1994.346118
  • Filename
    346118