DocumentCode :
1272385
Title :
Experience with Improving Distributed Shared Cache Performance on Tilera´s Tile Processor
Author :
Choi, Inseok ; Zhao, Minshu ; Yang, Xu ; Yeung, Donald
Author_Institution :
Univ. of Maryland, College Park, MD, USA
Volume :
10
Issue :
2
fYear :
2011
Firstpage :
45
Lastpage :
48
Abstract :
This paper describes our experience with profiling and optimizing physical locality for the distributed shared cache (DSC) in Tilera´s Tile multicore processor. Our approach uses the Tile Processor´s hardware performance measurement counters (PMCs) to acquire page-level access pattern profiles. A key problem we address is imprecise PMC interrupts. Our profiling tools use binary analysis to correct for interrupt ``skid,´´ thus pinpointing individual memory operations that incur remote DSC slice references and permitting us to sample their access patterns. We use our access pattern profiles to drive page homing optimizations for both heap and static data objects. Our experiments show we can improve physical locality for 5 out of 11 SPLASH2 benchmarks running on 32 cores, enabling 32.9%--77.9% of DSC references to target the local DSC slice. To our knowledge, this is the first work to demonstrate page homing optimizations on a real system.
Keywords :
cache storage; microprocessor chips; multiprocessing systems; PMC interrupt; Tilera tile multicore processor; binary analysis; distributed shared cache performance; hardware performance measurement counters; page homing optimization; page-level access pattern profile; profiling tool; Benchmark testing; Computer architecture; Data streams; Design methodology; Multicore processing; Multiprocessing systems; Design studies; Multi-core/single-chip multiprocessors; Multiple Data Stream Architectures (Multiprocessors);
fLanguage :
English
Journal_Title :
Computer Architecture Letters
Publisher :
ieee
ISSN :
1556-6056
Type :
jour
DOI :
10.1109/L-CA.2011.18
Filename :
5953732
Link To Document :
بازگشت