Title :
Proximity-Aware cache Replication
Author :
Li, Chongmin ; Wang, Dongsheng ; Wang, Haixia ; Xue, Yibo ; Li, Jian
Author_Institution :
Tsinghua Nat. Lab. for Inf. Sci. & Technol., Beijing, China
fDate :
Jan. 30 2012-Feb. 2 2012
Abstract :
We propose Proximity-Aware cache Replication (PAR), an LLC replication technique that elegantly integrates an intelligent cache replication placement mechanism and a hierarchical directory-based coherence protocol into one cost-effective and scalable design. PAR dynamically allocates replicas of either shared or private data to a few predefined and fixed locations that are calculated at chip design time. Therefore, PAR fits well to future many-core CMPs thanks to its scalable on-chip storage and coherence design. Simulation results on a 64-core CMP show that PAR can achieve 12% speedup over the baseline shared cache design with SPLASH2 and PARSEC workloads. It also provides around 5% speedup over a couple contemporary approaches with much simpler and scalable support. Translating this speedup to cache performance, PAR achieves 40% and 70% reduction over the baseline in average L1 miss latency and on-chip network traffic, respectively. Furthermore, PAR shows good speedup with multiprogrammed workloads.
Keywords :
cache storage; LLC replication; PARSEC; SPLASH2; hierarchical directory-based coherence protocol; intelligent cache replication placement mechanism; many-core CMP; proximity-aware cache replication; Coherence; Hardware; Organizations; Protocols; System-on-a-chip; Tiles; Vectors;
Conference_Titel :
Design Automation Conference (ASP-DAC), 2012 17th Asia and South Pacific
Conference_Location :
Sydney, NSW
Print_ISBN :
978-1-4673-0770-3
DOI :
10.1109/ASPDAC.2012.6165001