DocumentCode :
2627033
Title :
Pinpointing data locality bottlenecks with low overhead
Author :
Xu Liu ; Mellor-Crummey, John
Author_Institution :
Dept. of Comput. Sci., Rice Univ., Houston, TX, USA
fYear :
2013
fDate :
21-23 April 2013
Firstpage :
183
Lastpage :
193
Abstract :
A wide gap exists between the speed of modern processors and memory subsystems. As a result, long latencies associated with fetching data from memory often significantly degrade execution performance. To aid with program tuning, application developers need tools to analyze memory access patterns and guide them how to reuse data in the fastest levels of a system´s memory hierarchy. In this paper, we describe a novel, efficient, and effective tool for data locality measurement and analysis. Unlike other tools, our tool uses both statistical PMU sampling to quantify the cost of data locality bottlenecks and cache simulation to compute reuse distance to diagnose the causes of locality problems. This approach enables us to collect rich information to provide insight into a program´s data locality problems. Our tool attributes quantitative measurements of observed memory latency to program variables and dynamically allocated data, as well as code. Our tool identifies data touched by reuse pairs and the accesses involved, identified with their full calling context. Finally, our tool employs both sampling and parallelization to accelerate the computation of representative reuse distance information. Experiments show that with an overhead of only about 13%, our tool provides detailed insights that enabled us to make non-trivial improvements to memory-bound HPC benchmarks.
Keywords :
cache storage; sampling methods; storage management; cache simulation; data fetching; data locality bottleneck; data locality measurement; locality problem; memory access pattern analysis; memory latency; memory subsystem; processor; program tuning; quantitative measurement; statistical PMU sampling; Context; Distance measurement; Monitoring; Optimization; Phasor measurement units; Resource management; data locality; instruction-based sampling; program performance; reuse distance; software tools;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Performance Analysis of Systems and Software (ISPASS), 2013 IEEE International Symposium on
Conference_Location :
Austin, TX
Print_ISBN :
978-1-4673-5776-0
Electronic_ISBN :
978-1-4673-5778-4
Type :
conf
DOI :
10.1109/ISPASS.2013.6557169
Filename :
6557169
Link To Document :
بازگشت