• DocumentCode
    667982
  • Title

    A high-performance multiported L2 memory IP for scalable three-dimensional integration

  • Author

    Azarkhish, Erfan ; Loi, Igor ; Benini, Luca

  • Author_Institution
    Micrel Lab., Univ. of Bologna, Bologna, Italy
  • fYear
    2013
  • fDate
    2-4 Oct. 2013
  • Firstpage
    1
  • Lastpage
    8
  • Abstract
    L2 memory, serving multiple clusters of tightly coupled processors, is well-suited for 3D integration, given its large required size and tolerance to latency and variations in memory access time. In this paper, we focus on the design of a synthesizable L2 memory IP component, which can be attached to a cluster-based multi-core platform through its NoC ports, and offer high-bandwidth memory access with low average latency. We propose a scalable 3D nonuniform memory access (NUMA) architecture, based on low latency logarithmic interconnects, which allows stacking of multiple memory layers with identical dies, supports multiple outstanding transactions, and achieves high clock frequencies due to its highly pipelined nature. Benchmark simulation results demonstrate that addition of 3D-NUMA to a multi-core NoC can result in an average performance boost of 34%. Physical synthesis results show that 3D-NUMA memory system can operate at 500 MHz in STMicroelec-tronics CMOS-28nm Low Power Technology (bounded by memory cut access time, while its logic components can operate up to 1 GHz), up to 8 layers (4 MB) with a memory density loss of only 16%.
  • Keywords
    CMOS memory circuits; logic design; low-power electronics; network-on-chip; three-dimensional integrated circuits; 3D integration; 3D-NUMA memory system; NoC ports; STMicroelectronics CMOS low power technology; clock frequencies; cluster-based multicore platform; frequency 500 MHz; high-bandwidth memory access; high-performance multiported L2 memory IP; logic components; low latency logarithmic interconnects; memory access time; memory cut access time; memory density loss; memory layers; multicore NoC; scalable 3D nonuniform memory access architecture; scalable three-dimensional integration; size 28 nm; synthesizable L2 memory IP component; tightly coupled processors; Bandwidth; Engines; Pipelines; Program processors; Random access memory; Stacking; Three-dimensional displays;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    3D Systems Integration Conference (3DIC), 2013 IEEE International
  • Conference_Location
    San Francisco, CA
  • Type

    conf

  • DOI
    10.1109/3DIC.2013.6702347
  • Filename
    6702347