DocumentCode :
2949684
Title :
Practical off-chip meta-data for temporal memory streaming
Author :
Wenisch, Thomas F. ; Ferdman, Michael ; Ailamaki, Anastasia ; Falsafi, Babak ; Moshovos, Andreas
Author_Institution :
Univ. of Michigan, Ann Arbor, MI
fYear :
2009
fDate :
14-18 Feb. 2009
Firstpage :
79
Lastpage :
90
Abstract :
Prior research demonstrates that temporal memory streaming and related address-correlating prefetchers improve performance of commercial server workloads though increased memory level parallelism. Unfortunately, these prefetchers require large on-chip meta-data storage, making previously-proposed designs impractical. Hence, to improve practicality, researchers have sought ways to enable timely prefetch while locating meta-data entirely off-chip. Unfortunately, current solutions for off-chip meta-data increase memory traffic by over a factor of three. We observe three requirements to store meta-data off chip: minimal off-chip lookup latency, bandwidth-efficient meta-data updates, and off-chip lookup amortized over many prefetches. In this work, we show: (1) minimal off-chip meta-data lookup latency can be achieved through a hardware-managed main memory hash table, (2) bandwidth-efficient updates can be performed through probabilistic sampling of meta-data updates, and (3) off-chip lookup costs can be amortized by organizing meta-data to allow a single lookup to yield long prefetch sequences. Using these techniques, we develop sampled temporal memory streaming (STMS), a practical address-correlating prefetcher that keeps predictor meta-data in main memory while achieving 90% of the performance potential of idealized on-chip meta-data storage.
Keywords :
memory architecture; storage management; table lookup; bandwidth-efficient meta-data update; hardware-managed main memory hash table; minimal off-chip lookup latency; off-chip lookup cost; off-chip meta-data storage; prefetching; probabilistic sampling; sampled temporal memory streaming; Bandwidth; Costs; Delay; Hardware; Organizing; Prefetching; Sampling methods; Table lookup; Throughput; Web server;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
High Performance Computer Architecture, 2009. HPCA 2009. IEEE 15th International Symposium on
Conference_Location :
Raleigh, NC
ISSN :
1530-0897
Print_ISBN :
978-1-4244-2932-5
Type :
conf
DOI :
10.1109/HPCA.2009.4798239
Filename :
4798239
Link To Document :
بازگشت