Title :
Comparison of Shared and Private L1 Data Memories for an Embedded MPSoC in 28nm FD-SOI
Author :
Gregor Sievers;Julian Daberkow;Johannes Ax;Martin Flasskamp;Wayne Kelly;Thorsten Jungeblut;Mario Porrmann; Rückert
Author_Institution :
Cognitronics &
Abstract :
Tightly coupling CPUs within clusters allows for low latency, high bandwidth communication in MPSoCs. Our CoreVA-MPSoC integrates multiple clusters using an on-chip network. In this work we introduce a shared L1 data memory that can be accessed by all CPUs of a cluster with low latency. A Mesh-of-Trees (MoT) and a crossbar as topology for the shared memory interconnect are presented. A cluster that integrates the shared L1 memory is compared with an architecture that features an AXI interconnect and a local L1 data memory for each CPU. In addition, we consider an architecture that integrates both. We present implementation results using a 28nm FD-SOI standard cell technology. The shared L1 memory shows similar area results compared to the local memory architecture. Place and route results of a cluster with 8 CPUs, 128kB local-and 128kB shared L1 data memory divided into 16 memory banks show a frequency of 728MHz and an area of 1.77mm2. To map programs to the different CPU cluster configurations a compiler for streaming applications is used. An architecture with both local and shared L1 data memory and 4 memory banks shows best performance results in combination with a high resource efficiency.
Keywords :
"Clocks","Memory management","Central Processing Unit","Standards","Routing","Topology"
Conference_Titel :
Embedded Multicore/Many-core Systems-on-Chip (MCSoC), 2015 IEEE 9th International Symposium on
DOI :
10.1109/MCSoC.2015.25