DocumentCode :
1393171
Title :
A 45 nm Resilient Microprocessor Core for Dynamic Variation Tolerance
Author :
Bowman, Keith A. ; Tschanz, James W. ; Lu, Shih-Lien L. ; Aseron, Paolo A. ; Khellah, Muhammad M. ; Raychowdhury, Arijit ; Geuskens, Bibiche M. ; Tokunaga, Carlos ; Wilkerson, Chris B. ; Karnik, Tanay ; De, Vivek K.
Author_Institution :
Intel Corp., Hillsboro, OR, USA
Volume :
46
Issue :
1
fYear :
2011
Firstpage :
194
Lastpage :
208
Abstract :
A 45 nm microprocessor core integrates resilient error-detection and recovery circuits to mitigate the clock frequency (FCLK) guardbands for dynamic parameter variations to improve throughput and energy efficiency. The core supports two distinct error-detection designs, allowing a direct comparison of the relative trade-offs. The first design embeds error-detection sequential (EDS) circuits in critical paths to detect late timing transitions. In addition to reducing the Fclk guardbands for dynamic variations, the embedded EDS design can exploit path-activation rates to operate the microprocessor faster than infrequently-activated critical paths. The second error-detection design offers a less-intrusive approach for dynamic timing-error detection by placing a tunable replica circuit (TRC) per pipeline stage to monitor worst-case delays. Although the TRCs require a delay guardband to ensure the TRC delay is always slower than critical-path delays, the TRC design captures most of the benefits from the embedded EDS design with less implementation overhead. Furthermore, while core min-delay constraints limit the potential benefits of the embedded EDS design, a salient advantage of the TRC design is the ability to detect a wider range of dynamic delay variation, as demonstrated through low supply voltage (VCC) measurements. Both error-detection designs interface with error-recovery techniques, enabling the detection and correction of timing errors from fast-changing variations such as high-frequency VCC droops. The microprocessor core also supports two separate error-recovery techniques to guarantee correct execution even if dynamic variations persist. The first technique requires clock control to replay errant instructions at 1/2FCLK. In comparison, the second technique is a new multiple-issue instruction replay design that corrects errant instructions with a lower performance penalty and without requiring clock control. Silico- - n measurements demonstrate that resilient circuits enable a 41% throughput gain at equal energy or a 22% energy reduction at equal throughput, as compared to a conventional design when executing a benchmark program with a 10% VCC droop. In addition, the microprocessor includes a new adaptive clock control circuit that interfaces with the resilient circuits and a phase-locked loop (PLL) to track recovery cycles and adapt to persistent errors by dynamically changing Fclk f°Γ maximum efficiency.
Keywords :
circuit reliability; error detection; microprocessor chips; adaptive clock control circuit; clock frequency; critical path delay; dynamic delay variation; dynamic timing-error detection; dynamic variation tolerance; error-detection sequential circuit; error-recovery technique; path-activation rate; persistent error; phase-locked loop; recovery circuit; resilient circuit; resilient error-detection design; resilient microprocessor core; size 45 nm; tunable replica circuit; Clocks; Delay; Latches; Microprocessors; Phase locked loops; Pipelines; Resilient microprocessor; adaptive circuit; adaptive clocking; dynamic variation; error correction; error detection; error recovery; error-detection sequential circuit; multiple-issue instruction replay; resilient circuit; resilient design; timing error; tunable replica circuit; variation tolerance;
fLanguage :
English
Journal_Title :
Solid-State Circuits, IEEE Journal of
Publisher :
ieee
ISSN :
0018-9200
Type :
jour
DOI :
10.1109/JSSC.2010.2089657
Filename :
5654663
Link To Document :
بازگشت