Title :
Techniques for bandwidth-efficient prefetching of linked data structures in hybrid prefetching systems
Author :
Ebrahimi, Eiman ; Mutlu, Onur ; Patt, Yale N.
Author_Institution :
Dept. of Electr. & Comput. Eng., Univ. of Texas at Austin, Austin, TX
Abstract :
Linked data structure (LDS) accesses are critical to the performance of many large scale applications. Techniques have been proposed to prefetch such accesses. Unfortunately, many LDS prefetching techniques 1) generate a large number of useless prefetches, thereby degrading performance and bandwidth efficiency, 2) require significant hardware or storage cost, or 3) when employed together with stream-based prefetchers, cause significant resource contention in the memory system. As a result, existing processors do not employ LDS prefetchers even though they commonly employ stream-based prefetchers. This paper proposes a low-cost hardware/software cooperative technique that enables bandwidth-efficient prefetching of linked data structures. Our solution has two new components: 1) a compiler-guided prefetch filtering mechanism that informs the hardware about which pointer addresses to prefetch, 2) a coordinated prefetcher throttling mechanism that uses run-time feedback to manage the interference between multiple prefetchers (LDS and stream-based) in a hybrid prefetching system. Evaluations show that the proposed solution improves average performance by 22.5% while decreasing memory bandwidth consumption by 25% over a baseline system that employs an effective stream prefetcher on a set of memory- and pointer-intensive applications. We compare our proposal to three different LDS/correlation prefetching techniques and find that it provides significantly better performance on both single-core and multi-core systems, while requiring less hardware cost.
Keywords :
data structures; storage management; bandwidth-efficient prefetching; compiler-guided prefetch filtering mechanism; hardware-software cooperative technique; hybrid prefetching systems; linked data structures; memory system; stream-based prefetchers; Bandwidth; Costs; Data structures; Degradation; Feedback; Filtering; Hardware; Large-scale systems; Prefetching; Runtime;
Conference_Titel :
High Performance Computer Architecture, 2009. HPCA 2009. IEEE 15th International Symposium on
Conference_Location :
Raleigh, NC
Print_ISBN :
978-1-4244-2932-5
DOI :
10.1109/HPCA.2009.4798232