• DocumentCode
    1200795
  • Title

    On the design of a high-performance adaptive router for CC-NUMA multiprocessors

  • Author

    Puente, Valentin ; Gregorio, Jose-Angel ; Beivide, Ramon ; Izu, Cruz

  • Author_Institution
    Univ. of Cantabria, Spain
  • Volume
    14
  • Issue
    5
  • fYear
    2003
  • fDate
    5/1/2003 12:00:00 AM
  • Firstpage
    487
  • Lastpage
    501
  • Abstract
    This work presents the design and evaluation of an adaptive packet router aimed at supporting CC-NUMA traffic. We exploit a simple and efficient packet injection mechanism to avoid deadlock, which leads to a fully, adaptive routing by employing only three virtual channels. In addition, we selectively use output buffers for implementing the most utilized virtual paths in order to reduce head-of-line blocking. The careful implementation of these features has resulted in a good trade-off between the network performance and hardware cost. The outcome of this research is a high-performance adaptive router (HPAR), which adequately balances the needs of parallel applications: minimal network latency at low loads and high throughput at heavy loads. The paper includes an evaluation process in which HPAR is compared with other adaptive routers using FIFO input bufferring, with or without additional virtual channels to reduce head-of-line blocking. This evaluation contemplates both the VLSI costs of each router and their performance under synthetic and real application workloads. To make the comparison fair, all the routers use the same efficient deadlock avoidance mechanism. In all the experiments, HPAR exhibited the best response among all the routers tested. Moreover, the observed packet latencies were comparable to those exhibited by simpler routers. Therefore, HPAR can be considered as a suitable candidate to implement packet interchange in next generations of CC-NUMA multiprocessors.
  • Keywords
    network routing; packet switching; parallel architectures; performance evaluation; shared memory systems; system recovery; CC-NUMA multiprocessors; adaptive routing; deadlock avoidance; hardware router design; head-of-line blocking; high-performance adaptive router; interconnection networks; packet interchange; shared memory multiprocessors; virtual channels; Computer Society; Computer architecture; Costs; Delay; Hardware; Multiprocessing systems; Routing; System recovery; Telecommunication traffic; Throughput;
  • fLanguage
    English
  • Journal_Title
    Parallel and Distributed Systems, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1045-9219
  • Type

    jour

  • DOI
    10.1109/TPDS.2003.1199066
  • Filename
    1199066