DocumentCode
128990
Title
A multi banked — Multi ported — Non blocking shared L2 cache for MPSoC platforms
Author
Loi, Igor ; Benini, Luca
Author_Institution
DEI, Univ. of Bologna, Bologna, Italy
fYear
2014
fDate
24-28 March 2014
Firstpage
1
Lastpage
6
Abstract
On-chip L2 cache architectures, well established in high-performance parallel computing systems, are now becoming a performance-critical component also for multi/many-core architectures targeted at lower-power, embedded applications. The very stringent requirements on power and cost of these systems result in one of the key challenges in many-core designs, mandating the deployment of highly efficient L2 caches. In this perspective, sharing the L2 cache layer among all system cores has important advantages, such as increased utilization, fast inter-core communication, and reduced aggregate footprint because no undesired replication of lines occurs. This paper presents a novel architecture for a shared L2 cache system with multi-port and multi-bank features. We target this L2 cache to a many-core platform based on hierarchical cluster structure that does not employ private data caches, and therefore does not require complex coherency mechanisms. In fact, our shared L2 cache can be seen logically as a Last Level Cache (LLC) adopting the terminology of higher-performance many-core products, although in these latter the LLC is more often an L3 layer. Our experimental results show a maximum aggregate bandwidth of 28GB/s (89% of the maximum channel capacity) for 100% hit traffic with random banking conflicts, as a realistic case. Physical implementation results in 28nm Fully-Depleted-Silicon-on-Insulator (FDSoI) show that our L2 cache can operate at up to 1GHz with a memory density loss of only 20% with respect to an L2 scratchpad for a 2 MB configuration.
Keywords
cache storage; multiprocessing systems; silicon-on-insulator; system-on-chip; FDSoI; LLC; MPSoC platforms; aggregate bandwidth; fully-depleted-silicon-on-insulator; hierarchical cluster structure; hit traffic; inter-core communication; last level cache; many-core architectures; multibank features; multibanked shared L2 cache; multicore architectures; multiport features; multiported shared L2 cache; nonblocking shared L2 cache; on-chip L2 cache architectures; random banking conflicts; size 28 nm; Bandwidth; Banking; Benchmark testing; Computer architecture; Ports (Computers); Protocols; Random access memory;
fLanguage
English
Publisher
ieee
Conference_Titel
Design, Automation and Test in Europe Conference and Exhibition (DATE), 2014
Conference_Location
Dresden
Type
conf
DOI
10.7873/DATE.2014.093
Filename
6800294
Link To Document