DocumentCode
667982
Title
A high-performance multiported L2 memory IP for scalable three-dimensional integration
Author
Azarkhish, Erfan ; Loi, Igor ; Benini, Luca
Author_Institution
Micrel Lab., Univ. of Bologna, Bologna, Italy
fYear
2013
fDate
2-4 Oct. 2013
Firstpage
1
Lastpage
8
Abstract
L2 memory, serving multiple clusters of tightly coupled processors, is well-suited for 3D integration, given its large required size and tolerance to latency and variations in memory access time. In this paper, we focus on the design of a synthesizable L2 memory IP component, which can be attached to a cluster-based multi-core platform through its NoC ports, and offer high-bandwidth memory access with low average latency. We propose a scalable 3D nonuniform memory access (NUMA) architecture, based on low latency logarithmic interconnects, which allows stacking of multiple memory layers with identical dies, supports multiple outstanding transactions, and achieves high clock frequencies due to its highly pipelined nature. Benchmark simulation results demonstrate that addition of 3D-NUMA to a multi-core NoC can result in an average performance boost of 34%. Physical synthesis results show that 3D-NUMA memory system can operate at 500 MHz in STMicroelec-tronics CMOS-28nm Low Power Technology (bounded by memory cut access time, while its logic components can operate up to 1 GHz), up to 8 layers (4 MB) with a memory density loss of only 16%.
Keywords
CMOS memory circuits; logic design; low-power electronics; network-on-chip; three-dimensional integrated circuits; 3D integration; 3D-NUMA memory system; NoC ports; STMicroelectronics CMOS low power technology; clock frequencies; cluster-based multicore platform; frequency 500 MHz; high-bandwidth memory access; high-performance multiported L2 memory IP; logic components; low latency logarithmic interconnects; memory access time; memory cut access time; memory density loss; memory layers; multicore NoC; scalable 3D nonuniform memory access architecture; scalable three-dimensional integration; size 28 nm; synthesizable L2 memory IP component; tightly coupled processors; Bandwidth; Engines; Pipelines; Program processors; Random access memory; Stacking; Three-dimensional displays;
fLanguage
English
Publisher
ieee
Conference_Titel
3D Systems Integration Conference (3DIC), 2013 IEEE International
Conference_Location
San Francisco, CA
Type
conf
DOI
10.1109/3DIC.2013.6702347
Filename
6702347
Link To Document