Title :
Timing reconfigurable microarchitectures for power efficiency
Author :
Dhodapkar, Ashutosh S. ; Smith, James E.
Author_Institution :
Dept. of Electr. & Comput. Eng., Wisconsin Univ., Madison, WI, USA
Abstract :
Summary form only given. Modern microprocessors are designed with a fixed set of microarchitected resources. On the other hand, program resource requirements are known to vary across programs and even within a program as it goes through different phases of execution. This mismatch between the design and program requirements often leads to suboptimal power and/or performance. We present an adaptive microarchitecture that tailors itself dynamically to match changing program requirements. The goal of adaptation in this work is to achieve power efficiency without suffering significant performance degradation. The microarchitecture employs four multiconfiguration units nstruction cache, data cache, unified L2 cache and branch predictor along with light weight profiling hardware and sophisticated tuning algorithms. The profiling hardware collects working set signatures using a simple hash function and sparse sampling in order to reduce profiling hardware complexity. The tuning algorithms use these signatures to accurately detect program phase changes and decouple the tuning of each unit using a novel technique. Detailed simulations show that the best performing algorithm achieves subthreshold leakage power reductions as high as 76% for instruction cache, 46% for data cache, 63% for L2 cache and 73% for the branch predictor in the benchmarks studied. On average, the algorithm leads to 1.1% performance degradation while achieving 44%, 17%, 19%, and 28% reduction in subthreshold leakage power for instruction cache, data cache, L2 cache and branch predictor, respectively.
Keywords :
cache storage; instruction sets; parallel architectures; power consumption; reconfigurable architectures; branch predictor; data cache; hash function; instruction cache; microarchitecture tuning algorithm; microprocessor; power efficiency; program requirements; reconfigurable microarchitecture tuning; sparse sampling; unified L2 cache; Degradation; Design engineering; Hardware; Microarchitecture; Microprocessors; Phase detection; Power dissipation; Power engineering and energy; Power engineering computing; Sampling methods;
Conference_Titel :
Parallel and Distributed Processing Symposium, 2004. Proceedings. 18th International
Print_ISBN :
0-7695-2132-0
DOI :
10.1109/IPDPS.2004.1303102