DocumentCode :
3055625
Title :
Utilizing Dynamically Coupled Cores to Form a Resilient Chip Multiprocessor
Author :
LaFrieda, Christopher ; Ipek, Engin ; Martínez, José F. ; Manohar, Rajit
Author_Institution :
Cornell Univ., Ithaca
fYear :
2007
fDate :
25-28 June 2007
Firstpage :
317
Lastpage :
326
Abstract :
Aggressive CMOS scaling will make future chip multiprocessors (CMPs) increasingly susceptible to transient faults, hard errors, manufacturing defects, and process variations. Existing fault-tolerant CMP proposals that implement dual modular redundancy (DMR) do so by statically binding pairs of adjacent cores via dedicated communication channels and buffers. This can result in unnecessary power and performance losses in cases where one core is defective (in which case the entire DMR pair must be disabled), or when cores exhibit different frequency/leakage characteristics due to process variations (in which case the pair runs at the speed of the slowest core). Static DMR also hinders power density/thermal management, as DMR pairs running code with similar power/thermal characteristics are necessarily placed next to each other on the die. We present dynamic core coupling (DCC), an architectural technique that allows arbitrary CMP cores to verify each other´s execution while requiring no static core binding at design time or dedicated communication hardware. Our evaluation shows that the performance overhead of DCC over a CMP without fault tolerance is 3% on SPEC2000 benchmarks, and is within 5% for a set of scalable parallel scientific and data mining applications with up to eight threads (16 processors). Our results also show that DCC has the potential to significantly outperform existing static DMR schemes.
Keywords :
CMOS integrated circuits; microprocessor chips; CMOS; dual modular redundancy; dynamically coupled cores; resilient chip multiprocessor; CMOS process; Communication channels; Energy management; Fault tolerance; Frequency; Manufacturing processes; Performance loss; Proposals; Redundancy; Thermal management;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Dependable Systems and Networks, 2007. DSN '07. 37th Annual IEEE/IFIP International Conference on
Conference_Location :
Edinburgh
Print_ISBN :
0-7695-2855-4
Type :
conf
DOI :
10.1109/DSN.2007.100
Filename :
4272983
Link To Document :
بازگشت