DocumentCode
3457366
Title
DDCache: Decoupled and Delegable Cache Data and Metadata
Author
Hossain, Hemayet ; Dwarkadas, Sandhya ; Huang, Michael C.
Author_Institution
Univ. of Rochester, Rochester, NY, USA
fYear
2009
fDate
12-16 Sept. 2009
Firstpage
227
Lastpage
236
Abstract
In order to harness the full compute power of many-core processors, future designs must focus on effective utilization of on-chip cache and bandwidth resources. In this paper, we address the dual goals of (1) reducing on-chip communication overheads and (2) improving on-chip cache space utilization resulting in larger effective cache capacity and thereby potentially reduced off-chip traffic. We present a new cache coherence protocol that decouples the logical binding between data and metadata in a cache set. This decoupling allows data and metadata for a cache line to be independently delegated to any location on chip. By delegating metadata to the current owner/modifier of a cache line, communication overhead for metadata maintenance is avoided and communication can be effectively localized between interacting processes. By decoupling metadata from data, data space in the cache can be more efficiently utilized by avoiding unnecessary data replication. Using full system simulation, we demonstrate that our decoupled protocol achieves an average (geometric mean) speedup of 1.24 (1.3 with microbenchmarks) compared to a base statically mapped directory-based non-uniform cache access protocol, while generating only 65% and 74% of the on-chip and off-chip traffic respectively, and consuming 74% of the corresponding energy (95% of the power) in the on-chip memory and interconnect compared to the base system.
Keywords
cache storage; microprocessor chips; multiprocessing systems; DDCache; bandwidth resources; cache capacity; cache coherence protocol; cache set; data space; decoupled cache data; decoupled protocol; decoupling metadata; delegable cache data; directory-based nonuniform cache access protocol; geometric mean; logical binding; many-core processor; meta data; metadata maintenance; off-chip traffic; on-chip cache space utilization; on-chip communication overhead; unnecessary data replication; Access protocols; Bandwidth; Concurrent computing; Parallel architectures; Power generation; Power system interconnection; Process design; Solid modeling; System-on-a-chip; Traffic control; Cache Coherence; Chip Multiprocessors; DDCache; Decoupled Cache; Delegable Cache;
fLanguage
English
Publisher
ieee
Conference_Titel
Parallel Architectures and Compilation Techniques, 2009. PACT '09. 18th International Conference on
Conference_Location
Raleigh, NC
ISSN
1089-795X
Print_ISBN
978-0-7695-3771-9
Type
conf
DOI
10.1109/PACT.2009.24
Filename
5260541
Link To Document