DocumentCode :
2380446
Title :
Exploiting producer patterns and L2 cache for timely dependence-based prefetching
Author :
Lim, Chungsoo ; Byrd, Gregory T.
Author_Institution :
North Carolina State Univ., Raleigh, NC
fYear :
2008
fDate :
12-15 Oct. 2008
Firstpage :
685
Lastpage :
692
Abstract :
This paper proposes an architecture that efficiently prefetches for loads whose effective addresses are directly dependent on previously-loaded values. This dependence-based prefetching scheme covers most frequently missed loads in programs that contain linked data structures (LDS). For timely prefetches, memory access patterns of producing loads are dynamically learned. These patterns (such as strides) are used to prefetch well ahead of the consumer load. The proposed prefetcher is placed near the processor core and targets L1 cache misses, because removing L1 cache misses has greater performance potential than removing L2 cache misses. We also examine how to capture pointers in LDS with pure hardware implementation. We find that the space requirement can be reduced, compared to previous work, if we selectively record patterns. Still, to make the prefetching scheme generally applicable, a large table is required for storing pointers. We show that storing the prefetch table in a partition of the L2 cache outperforms using the L2 cache conventionally.
Keywords :
storage management; L1 cache misses; L2 cache misses; linked data structures; memory access patterns; prefetch table; processor core; producer patterns; timely dependence-based prefetching; Clocks; Data structures; Delay; Hardware; History; Impedance; Parallel processing; Prefetching; Process design; Throughput;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Design, 2008. ICCD 2008. IEEE International Conference on
Conference_Location :
Lake Tahoe, CA
ISSN :
1063-6404
Print_ISBN :
978-1-4244-2657-7
Electronic_ISBN :
1063-6404
Type :
conf
DOI :
10.1109/ICCD.2008.4751935
Filename :
4751935
Link To Document :
بازگشت