DocumentCode
2480459
Title
Embedded way prediction for last-level caches
Author
Sleiman, Faissal M. ; Dreslinski, Ronald G. ; Wenisch, Thomas F.
Author_Institution
Electr. Eng. & Comput. Sci., Univ. of Michigan, Ann Arbor, MI, USA
fYear
2012
fDate
Sept. 30 2012-Oct. 3 2012
Firstpage
167
Lastpage
174
Abstract
This paper investigates Embedded Way Prediction for large last-level caches (LLCs): an architecture and circuit design to provide the latency of parallel tag-data access at substantial energy savings. Existing way prediction approaches for L1 caches are compromised by the high associativity and filtered temporal locality of LLCs. We demonstrate: (1) the need for wide partial tag comparison, which we implement with a dynamic CAM alongside the data sub-array wordline decode, and (2) the inhibit bit, an architectural innovation to provide accurate predictions when the partial tag comparison is inconclusive. We present circuit critical-path and architectural power/performance studies demonstrating speedups of up to 15.4% (6.6% average) for scientific and server applications, matching the performance of parallel tag-data access while reducing energy overhead by 40%.
Keywords
cache storage; embedded systems; L1 caches; LLC; architectural innovation; circuit critical-path; circuit design; data subarray wordline decode; dynamic CAM; embedded way prediction; energy overhead reduction; last-level caches; parallel tag-data access; partial tag comparison; temporal locality; wide partial tag comparison; Arrays; Computer aided manufacturing; Decoding; Delay; Random access memory; Servers; Tiles;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Design (ICCD), 2012 IEEE 30th International Conference on
Conference_Location
Montreal, QC
ISSN
1063-6404
Print_ISBN
978-1-4673-3051-0
Type
conf
DOI
10.1109/ICCD.2012.6378636
Filename
6378636
Link To Document