• DocumentCode
    2484766
  • Title

    An evaluative study on the effect of contention on message latencies in large supercomputers

  • Author

    Bhatelé, Abhinav ; Kale, Laxmikant V.

  • Author_Institution
    Dept. of Comput. Sci., Univ. of Illinois at Urbana-Champaign, Urbana, IL, USA
  • fYear
    2009
  • fDate
    23-29 May 2009
  • Firstpage
    1
  • Lastpage
    8
  • Abstract
    Significant theoretical research was done on interconnect topologies and topology aware mapping for parallel computers in the 80s. With the deployment of virtual cut-through, wormhole routing and faster interconnects, message latencies reduced and research in the area died down. This paper presents a study showing that with the emergence of very large supercomputers, typically connected as a 3D torus or mesh, topology effects have become important again. It presents an evaluative study on the effect of contention on message latencies on torus and mesh networks. The paper uses three MPI benchmarks to evaluate the effect of hops (links) traversed by messages, on their latencies. The benchmarks demonstrate that when multiple messages compete for network resources, link occupancy or contention can increase message latencies by up to a factor of 8 times. In other words, contention leads to increased message latencies and reduces effective available bandwidth for each message. This suggests that application developers should consider interconnect topologies when mapping tasks to processors in order to obtain the best performance. Results are shown for two parallel machines - ANL´s Blue Gene/P and PSC´s XT3.
  • Keywords
    application program interfaces; interconnections; message passing; network topology; parallel machines; performance evaluation; 3D torus networks; ANL; Blue Gene/P; MPI benchmarks; PSC; XT3; interconnect topologies; large supercomputers; link contention; link occupancy; mesh networks; message latency; network resources; parallel computers; parallel machines; topology aware mapping; topology effects; virtual cut-through; wormhole routing; Bandwidth; Computer science; Concurrent computing; Delay; Equations; Mesh networks; Network topology; Parallel machines; Routing; Supercomputers;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel & Distributed Processing, 2009. IPDPS 2009. IEEE International Symposium on
  • Conference_Location
    Rome
  • ISSN
    1530-2075
  • Print_ISBN
    978-1-4244-3751-1
  • Electronic_ISBN
    1530-2075
  • Type

    conf

  • DOI
    10.1109/IPDPS.2009.5161094
  • Filename
    5161094