DocumentCode :
1606620
Title :
Unified system level reliability evaluation methodology for multiprocessor Systems-on-Chip
Author :
Yamamoto, Alexandre Yasuo ; Ababei, Cristinel
Author_Institution :
Dept. of Electr. & Comput. Eng., North Dakota State Univ., Fargo, ND, USA
fYear :
2012
Firstpage :
1
Lastpage :
6
Abstract :
Reliability is a growing fundamental challenge in the design of multiprocessor Systems-on-Chip (MPSoCs). This trend is accelerated by the increasingly adverse process variations and wearout mechanisms that result in an increased number of errors. Previously proposed fault-tolerant techniques are ad-hoc and target processors or Networks-on-Chip (NoC) separately. Because each of these two units may become a reliability bottleneck for NoC based multiprocessor SoCs, it is imperative that the reliability of SoCs be evaluated and addressed in a unified manner, as a combination of communication and computational units. Using this holistic approach, in this paper, we propose a new architecture level unified reliability evaluation methodology for MPSoCs. At the core of the reliability estimation engine lies a Monte Carlo algorithm which works with failure times for time-dependent dielectric breakdown (TDDB) and negative bias temperature instability (NBTI) modeled as Weibull distributions. To demonstrate its usefulness, we utilize the proposed methodology to explore the impact of NoC router layout on the failure time of the system running the same set of benchmarks. In addition, we investigate the failure time of the system when the NoC as the communication unit of the MPSoC is taken or not - as in previous work - into consideration. Our simulation framework can be very helpful to architecture designers, who could use it to identify architectural characteristics and to develop design techniques meant to improve system´s lifetime.
Keywords :
Monte Carlo methods; Weibull distribution; electric breakdown; fault tolerant computing; integrated circuit layout; integrated circuit reliability; multiprocessing systems; network routing; network-on-chip; MPSoC; Monte Carlo algorithm; NBTI; NoC; NoC router layout impact; TDDB; Weibull distributions; architecture level unified reliability evaluation methodology; design techniques; fault-tolerant technique; multiprocessor system-on-chip; negative bias temperature instability; network-on-chip; process variations; reliability estimation engine; system failure time; system lifetime improvement; time-dependent dielectric breakdown; unified system level reliability evaluation methodology; wearout mechanisms; Benchmark testing; Computer architecture; Monte Carlo methods; Power demand; Program processors; Reliability; Tiles; lifetime; multiprocessor system-on-chip; network-on-chip; reliability;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Green Computing Conference (IGCC), 2012 International
Conference_Location :
San Jose, CA
Print_ISBN :
978-1-4673-2155-6
Electronic_ISBN :
978-1-4673-2153-2
Type :
conf
DOI :
10.1109/IGCC.2012.6322282
Filename :
6322282
Link To Document :
بازگشت