DocumentCode :
3084144
Title :
Surviving to Errors in Multi-Core Environments
Author :
Vera, Xavier ; Abella, Jaume
Author_Institution :
Intel Labs - UPC, Barcelona
fYear :
2007
fDate :
8-11 July 2007
Firstpage :
260
Lastpage :
260
Abstract :
In this paper, the authors present a global view of the issues outlined above as well as some directions to address them. First, the most important sources of failure (SOF) are presented as well as their impact on CMOS technology. Then, techniques and key parameters to measure degradation due to different SOF are introduced and microarchitectural approaches to mitigate degradation are outlined. The problem of error detection and anticipation is illustrated as well as pros and cons of different types of mechanisms to perform such detection and anticipation. Finally, we illustrate the whole picture where performance and reliability must be traded carefully. We point out some directions to use the information about the detected errors and the amount of degradation of each component to configure the multi-core in such a way that performance is maximized without compromising reliability.
Keywords :
CMOS integrated circuits; error detection; failure analysis; integrated circuit reliability; CMOS technology; error anticipation; error detection; microarchitectural approaches; multi-core environments; sources of failure; CMOS technology; Computer vision; Costs; Current density; Degradation; Electromigration; Face detection; Niobium compounds; Power system reliability; Titanium compounds;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
On-Line Testing Symposium, 2007. IOLTS 07. 13th IEEE International
Conference_Location :
Crete
Print_ISBN :
0-7695-2918-6
Type :
conf
DOI :
10.1109/IOLTS.2007.65
Filename :
4274863
Link To Document :
بازگشت