• DocumentCode
    1925234
  • Title

    A New End-to-End Flow-Control Mechanism for High Performance Computing Clusters

  • Author

    Prades, Javier ; Silla, Federico ; Duato, José ; Fröning, Holger ; Nüssle, Mondrian

  • Author_Institution
    Dept. d´´Inf. de Sist. i Computadors, Univ. Politec. de Valencia, Valencia, Spain
  • fYear
    2012
  • fDate
    24-28 Sept. 2012
  • Firstpage
    320
  • Lastpage
    328
  • Abstract
    High Performance Computing usually leverages messaging libraries such as MPI or GASNet in order to exchange data among processes in large-scale clusters. Furthermore, these libraries make use of specialized low-level networking layers in order to retrieve as much performance as possible from hardware interconnects such as Infini Band or Myrinet, for example. EXTOLL is another emerging technology targeted for high performance clusters. These specialized low-level networking layers require some kind of flow control in order to prevent buffer overflows at the received side. In this paper we present a new flow control mechanism that is able to adapt the buffering resources used by a process according to the parallel application communication pattern and the varying activity among communicating peers. The tests carried out in a 64-node 1024-core EXTOLL cluster show that our new dynamic flow-control mechanism provides extraordinarily high buffer efficiency along with very low overhead, which is reduced between 8 and 10 times.
  • Keywords
    buffer storage; multiprocessing systems; software libraries; workstation clusters; EXTOLL; GASNet; Infini Band; MPI; Myrinet; buffering resources; data exchange; end-to-end flow-control mechanism; hardware interconnects; high performance computing clusters; large-scale clusters; low-level networking layers; messaging libraries; parallel application communication pattern; Engines; Libraries; Process control; Protocols; Random access memory; Receivers; Software; EXTOLL; MPI; RMA; VELO; dynamic flow-control; static flow-control;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Cluster Computing (CLUSTER), 2012 IEEE International Conference on
  • Conference_Location
    Beijing
  • Print_ISBN
    978-1-4673-2422-9
  • Type

    conf

  • DOI
    10.1109/CLUSTER.2012.15
  • Filename
    6337794