DocumentCode :
2846404
Title :
Accurate and complexity-effective spatial pattern prediction
Author :
Chen, Chi F. ; Yang, Se-Hyun ; Falsafi, Babak ; Moshovos, Andreas
Author_Institution :
Comput. Archit. Lab., Carnegie Mellon Univ., Pittsburgh, PA, USA
fYear :
2004
fDate :
14-18 Feb. 2004
Firstpage :
276
Lastpage :
287
Abstract :
Recent research suggests that there are large variations in a cache´s spatial usage, both within and across programs. Unfortunately, conventional caches typically employ fixed cache line sizes to balance the exploitation of spatial and temporal locality, and to avoid prohibitive cache fill bandwidth demands. The resulting inability of conventional caches to exploit spatial variations leads to suboptimal performance and unnecessary cache power dissipation. We describe the spatial pattern predictor (SPP), a cost-effective hardware mechanism that accurately predicts reference patterns within a spatial group (i.e., a contiguous region of data in memory) at runtime. The key observation enabling an accurate, yet low-cost, SPP design is that spatial patterns correlate well with instruction addresses and data reference offsets within a cache line. We require only a small amount of predictor memory to store the predicted patterns. Simulation results for a 64-Kbyte 2-way set-associative Ll data cache with 64-byte lines show that: (1) a 256-entry tag-less direct-mapped SPP can achieve, on average, a prediction coverage of 95%, over-predicting the patterns by only 8%, (2) assuming a 70 nm process technology, the SPP helps reduce leakage energy in the base cache by 41% on average, incurring less than 1% performance degradation, and (3) prefetching spatial groups of up to 512 bytes using SPP improves execution time by 33% on average and up to a factor of two.
Keywords :
cache storage; pattern recognition; spatial data structures; cache line; cache´s spatial usage; data cache; hardware mechanism; instruction address; predictor memory; reference pattern; set-associative Ll data cache; spatial group; spatial pattern predictor; Bandwidth; CMOS technology; Computer architecture; Degradation; Delay; Design optimization; Energy dissipation; Laboratories; Power dissipation; Prefetching;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Software, IEE Proceedings-
ISSN :
1530-0897
Print_ISBN :
0-7695-2053-7
Type :
conf
DOI :
10.1109/HPCA.2004.10010
Filename :
1410084
Link To Document :
بازگشت