• DocumentCode
    44328
  • Title

    Smart: Single-Cycle Multihop Traversals over a Shared Network on Chip

  • Author

    Krishna, Tushar ; Chen, Chia-Hsin Owen ; Woo-Cheol Kwon ; Li-Shiuan Peh

  • Author_Institution
    Massachusetts Inst. of Technol., Cambridge, MA, USA
  • Volume
    34
  • Issue
    3
  • fYear
    2014
  • fDate
    May-June 2014
  • Firstpage
    43
  • Lastpage
    56
  • Abstract
    As the number of on-chip cores increases, scalable on-chip topologies such as meshes inevitably add multiple hops to each network traversal. The best practice today is to design one-cycle routers, such that the low-load network latency between a source and destination is equal to the number of routers and links (that is, twice the hops) between them. Designers of operating systems, compilers, and cache coherence protocols often try to limit communication to within a few hops because on-chip latency is critical for their scalability. In this article, the authors propose an on-chip network called Smart (Single-cycle Multihop Asynchronous Repeated Traversal) that aims to present a single-cycle datapath all the way from the source to the destination. They do not add any additional fast physical express links in the datapath; instead, they drive the shared crossbars and links asynchronously up to multiple hops within a single cycle. They designed a router and link microarchitecture to achieve such a traversal, and a flow-control technique to arbitrate and set up multihop paths within a cycle. A place-and-route design at 45 nm achieves 11 hops within a 1-GHz cycle for paths without turns (9 hops for paths with turns). The authors observe 5 to 8 times reduction in low-load latencies across synthetic traffic patterns on an 8×8 chip multiprocessor, compared to a baseline one-cycle router network. Full-system simulations with Splash-2 and Parsec benchmarks demonstrate 27 and 52 percent reduction in runtime for private and shared level-2 designs, respectively.
  • Keywords
    integrated circuit design; network routing; network-on-chip; Parsec benchmarks; Smart on-chip network; Splash-2; chip multiprocessor; flow-control technique; low-load latencies; microarchitecture links; multihop paths; one-cycle router network; place-and-route design; router design; runtime reduction; shared network on chip; single-cycle datapath; single-cycle multihop asynchronous repeated traversal; synthetic traffic patterns; Delays; Digital communication; Ports (Computers); Repeaters; Spread spectrum communication; System-on-chip; Delays; Digital communication; Ports (Computers); Repeaters; Spread spectrum communication; System-on-chip; high performance computing; multicore processor; network on chip; networking; repeated link; virtualization;
  • fLanguage
    English
  • Journal_Title
    Micro, IEEE
  • Publisher
    ieee
  • ISSN
    0272-1732
  • Type

    jour

  • DOI
    10.1109/MM.2014.48
  • Filename
    6828577