Title :
Amoeba-Cache: Adaptive Blocks for Eliminating Waste in the Memory Hierarchy
Author :
Kumar, Sudhakar ; Hongzhou Zhao ; Shriraman, A. ; Matthews, E. ; Dwarkadas, Sandhya ; Shannon, Lesley
Abstract :
The fixed geometries of current cache designs do not adapt to the working set requirements of modern applications, causing significant inefficiency. The short block lifetimes and moderate spatial locality exhibited by many applications result in only a few words in the block being touched prior to eviction. Unused words occupy between 17-80% of a 64K L1 cache and between 1%-79% of a 1MB private LLC. This effectively shrinks the cache size, increases miss rate, and wastes on-chip bandwidth. Scaling limitations of wires mean that unused-word transfers comprise a large fraction (11%) of on-chip cache hierarchy energy consumption. We propose Amoeba-Cache, a design that supports a variable number of cache blocks, each of a different granularity. Amoeba-Cache employs a novel organization that completely eliminates the tag array, treating the storage array as uniform and morph able between tags and data. This enables the cache to harvest space from unused words in blocks for additional tag storage, thereby supporting a variable number of tags (and correspondingly, blocks). Amoeba-Cache adjusts individual cache line granularities according to the spatial locality in the application. It adapts to the appropriate granularity both for different data objects in an application as well as for different phases of access to the same data. Overall, compared to a fixed granularity cache, the Amoeba-Cache reduces miss rate on average (geometric mean) by 18% at the L1 level and by 18% at the L2 level and reduces L1-L2 miss bandwidth by ≃46%. Correspondingly, Amoeba-Cache reduces on-chip memory hierarchy energy by as much as 36% (mcf) and improves performance by as much as 50% (art).
Keywords :
cache storage; energy consumption; memory architecture; performance evaluation; power aware computing; Amoeba-Cache; adaptive blocks; block lifetimes; cache blocks; cache design; cache size; data access; memory hierarchy; miss rate reduction; on-chip bandwidth; on-chip cache hierarchy energy consumption; on-chip memory hierarchy energy reduction; performance improvement; private LLC; spatial locality; storage array; tag array elimination; tag storage; unused-word transfers; waste elimination; wire scaling limitations; Adaptive granularity; Amoeba-Cache; Cache architecture; Energy efficiency; Memory system;
Conference_Titel :
Microarchitecture (MICRO), 2012 45th Annual IEEE/ACM International Symposium on
Conference_Location :
Vancouver, BC
Print_ISBN :
978-1-4673-4819-5
DOI :
10.1109/MICRO.2012.42