Title :
AREP: Adaptive Resource Efficient Prefetching for Maximizing Multicore Performance
Author :
Muneeb Khan;Michael A. Laurenzanoy;Jason Marsy;Erik Hagersten;David Black-Schaffer
Author_Institution :
Uppsala Univ. &
Abstract :
Modern processors widely use hardware prefetching to hide memory latency. While aggressive hardware prefetchers can improve performance significantly for some applications, they can limit the overall performance in highly-utilized multicore processors by saturating the offchip bandwidth and wasting last-level cache capacity. Co-executing applications can slowdown due to contention over these shared resources. This work introduces Adaptive Resource Efficient Prefetching (AREP) -- a runtime framework that dynamically combines software prefetching and hardware prefetching to maximize throughput in highly utilized multicore processors. AREP achieves better performance by prefetching data in a resource efficient way -- conserving offchip-bandwidth and last-level cache capacity with accurate prefetching and by applying cache-bypassing when possible. AREP dynamically explores a mix of hardware/software prefetching policies, then selects and applies the best performing policy. AREP is phase-aware and re-explores (at runtime) for the best prefetching policy at phase boundaries. A multitude of experiments with workload mixes and parallel applications on a modern high performance multicore show that AREP can increase throughput by up to 49% (8.1% on average). This is complemented by improved fairness, resulting in average quality of service above 94%.
Keywords :
"Prefetching","Hardware","Runtime","Bandwidth","Multicore processing"
Conference_Titel :
Parallel Architecture and Compilation (PACT), 2015 International Conference on
DOI :
10.1109/PACT.2015.35