• DocumentCode
    3079916
  • Title

    A fully-synthesizable single-cycle interconnection network for Shared-L1 processor clusters

  • Author

    Rahimi, Abbas ; Loi, Igor ; Kakoee, Mohammad Reza ; Benini, Luca

  • Author_Institution
    CSE Dept., Univ. of California, La Jolla, CA, USA
  • fYear
    2011
  • fDate
    14-18 March 2011
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    Shared L1 memory is an interesting architectural option for building tightly-coupled multi-core processor clusters. We designed a parametric, fully combinational Mesh-of-Trees (MoT) interconnection network to support high-performance, single-cycle communication between processors and memories in L1-coupled processor clusters. Our interconnect IP is described in synthesizable RTL and it is coupled with a design automation strategy mixing advanced synthesis and physical optimization to achieve optimal delay, power, area (DPA) under a wide range of design constraints. We explore DPA for a large set of network configurations in 65nm technology. Post placement&routing delay is 38FO4 for a configuration with 8 processors and 16 32-bit memories (8×16); when the number of both processors and memories is increased by a factor of 4, the delay increases almost logarithmically, to 84FO4, confirming scalability across a significant range of configurations. DPA tradeoff flexibility is also promising: in comparison to the maxperformance 16×32 configuration, there is potential to save power and area by 45% and 12 % respectively, at the expense of 30% performance degradation.
  • Keywords
    multiprocessor interconnection networks; shared memory systems; L1-coupled processor clusters; combinational mesh-of-trees interconnection network; design automation strategy mixing advanced synthesis; design constraints; fully-synthesizable single-cycle interconnection network; high-performance single-cycle communication; interconnect IP; physical optimization; shared L1 memory; shared-L1 processor clusters; synthesizable RTL; tightly-coupled multicore processor clusters; Bandwidth; Clocks; Delay; Multiprocessor interconnection; Routing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Design, Automation & Test in Europe Conference & Exhibition (DATE), 2011
  • Conference_Location
    Grenoble
  • ISSN
    1530-1591
  • Print_ISBN
    978-1-61284-208-0
  • Type

    conf

  • DOI
    10.1109/DATE.2011.5763085
  • Filename
    5763085