Title :
A high-performance multiported L2 memory IP for scalable three-dimensional integration
Author :
Azarkhish, Erfan ; Loi, Igor ; Benini, Luca
Author_Institution :
Micrel Lab., Univ. of Bologna, Bologna, Italy
Abstract :
L2 memory, serving multiple clusters of tightly coupled processors, is well-suited for 3D integration, given its large required size and tolerance to latency and variations in memory access time. In this paper, we focus on the design of a synthesizable L2 memory IP component, which can be attached to a cluster-based multi-core platform through its NoC ports, and offer high-bandwidth memory access with low average latency. We propose a scalable 3D nonuniform memory access (NUMA) architecture, based on low latency logarithmic interconnects, which allows stacking of multiple memory layers with identical dies, supports multiple outstanding transactions, and achieves high clock frequencies due to its highly pipelined nature. Benchmark simulation results demonstrate that addition of 3D-NUMA to a multi-core NoC can result in an average performance boost of 34%. Physical synthesis results show that 3D-NUMA memory system can operate at 500 MHz in STMicroelec-tronics CMOS-28nm Low Power Technology (bounded by memory cut access time, while its logic components can operate up to 1 GHz), up to 8 layers (4 MB) with a memory density loss of only 16%.
Keywords :
CMOS memory circuits; logic design; low-power electronics; network-on-chip; three-dimensional integrated circuits; 3D integration; 3D-NUMA memory system; NoC ports; STMicroelectronics CMOS low power technology; clock frequencies; cluster-based multicore platform; frequency 500 MHz; high-bandwidth memory access; high-performance multiported L2 memory IP; logic components; low latency logarithmic interconnects; memory access time; memory cut access time; memory density loss; memory layers; multicore NoC; scalable 3D nonuniform memory access architecture; scalable three-dimensional integration; size 28 nm; synthesizable L2 memory IP component; tightly coupled processors; Bandwidth; Engines; Pipelines; Program processors; Random access memory; Stacking; Three-dimensional displays;
Conference_Titel :
3D Systems Integration Conference (3DIC), 2013 IEEE International
Conference_Location :
San Francisco, CA
DOI :
10.1109/3DIC.2013.6702347