Title :
Improving VLIW Processor Performance Using Three-Dimensional (3D) DRAM Stacking
Author :
Pan, Yangyang ; Zhang, Tong
Author_Institution :
ECSE Dept., Rensselaer Polytech. Inst., Troy, NY, USA
Abstract :
This work studies the potential of using emerging 3D integration to improve embedded VLIW computing system. We focus on the 3D integration of one VLIW processor die with multiple high-capacity DRAM dies. Our proposed memory architecture employs 3D stacking technology to bond one die containing several processing clusters to multiple DRAM dies for a primary memory. The 3D technology also enables wide low-latency buses between clusters and memory and enable the latency of 3D DRAM L2 cache comparable to 2D SRAM L2 cache. These enable it to replace the 2D SRAM L2 cache with 3D DRAM L2 cache. The die area for 2D SRAM L2 cache can be re-allocated to additional clusters that can improve the performance of the system. From the simulation results, we find 3D stacking DRAM main memory can improve the system performance by 10%~80% than 2D off-chip DRAM main memory depending on different benchmarks. Also, for a similar logic die area, a four clusters system with 3D DRAM L2 cache and 3D DRAM main memory outperforms a two clusters system with 2D SRAM L2 cache and 3D DRAM main memory by about 10%.
Keywords :
DRAM chips; SRAM chips; cache storage; instruction sets; memory architecture; multiprocessing systems; parallel architectures; parallel machines; 2D SRAM L2 cache; 3D DRAM L2 cache; 3D DRAM stacking; embedded VLIW computing system; low-latency buses; memory architecture; Computer architecture; Delay; Digital signal processing; Embedded computing; Parallel processing; Random access memory; Stacking; System performance; USA Councils; VLIW; 3D DRAM; DSP; VLIW;
Conference_Titel :
Application-specific Systems, Architectures and Processors, 2009. ASAP 2009. 20th IEEE International Conference on
Conference_Location :
Boston, MA
Print_ISBN :
978-0-7695-3732-0
Electronic_ISBN :
2160-0511
DOI :
10.1109/ASAP.2009.11