DocumentCode :
166158
Title :
Modeling cache coherence misses on multicores
Author :
Xiaoyue Pan ; Jonsson, Bengt
Author_Institution :
Dept. of Inf. Technol., Uppsala Univ., Uppsala, Sweden
fYear :
2014
fDate :
23-25 March 2014
Firstpage :
96
Lastpage :
105
Abstract :
While maintaining the coherency of private caches, invalidation-based cache coherence protocols introduce cache coherence misses. We address the problem of predicting the number of cache coherence misses in the private cache of a parallel application when running on a multicore system with an invalidation-based cache coherence protocol. We propose three new performance models (uniform, phased and symmetric) for estimating the number of coherence misses from information about inter-core data sharing patterns and the individual core´s data reuse patterns. The inputs to the uniform and phased models are the write frequency and reuse distance distribution of shared data from different cores. This input can be obtained either from profiling the target application on a single core or by analyzing the data access pattern statically, and does not need a detailed simulation of the pattern of interleaving accesses to shared data. The output of the models is an estimated number of coherence misses of the target application. The output can be combined with the number of other kinds of misses to estimate the total number of misses in each core´s private cache. This output can also be used to guide program optimization to improve cache performance. We evaluate our models with a set of benchmarks from the PARSEC benchmark suite on real hardware.
Keywords :
cache storage; data handling; multiprocessing systems; protocols; PARSEC benchmark suite; cache coherence misses; cache performance improvement; core data reuse patterns; data access pattern; intercore data sharing patterns; invalidation-based cache coherence protocols; multicore system; phased performance models; private cache coherency; program optimization; symmetric performance models; uniform performance models; Benchmark testing; Coherence; Data models; Instruction sets; Multicore processing; Predictive models; Protocols;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Performance Analysis of Systems and Software (ISPASS), 2014 IEEE International Symposium on
Conference_Location :
Monterey, CA
Print_ISBN :
978-1-4799-3604-5
Type :
conf
DOI :
10.1109/ISPASS.2014.6844465
Filename :
6844465
Link To Document :
بازگشت