DocumentCode :
127146
Title :
Life after failure
Author :
Bidokhti, Nematollah ; Tehranipoor, Mohammad ; Jifeng Chen ; Lee, Jeyull
fYear :
2014
fDate :
27-30 Jan. 2014
Firstpage :
1
Lastpage :
7
Abstract :
This paper discusses the challenges, strategy and mitigation methods to prolong the operation of a product even after it is either about to have a failure or have experienced actual failures. The focus is mostly on two component types ASICs and memories. The aging and failure mitigation techniques vary by component type. Design for reliability techniques so far have been focused on measuring the impact of failures and how to prevent them as much as possible. The current advances in DFR methodologies are reaching their maximum level of effectiveness, still ASIC and memory failures are occurring impacting service operation of many products in the industry. Today´s technologies and their challenges require a different approach to addressing failures, which is the ability to have real time self-healing capability. The methodologies that have been developed in the area of self-healing have shown a promising future in the product overall fault management where failures due to aging ASICs and failing memories can be addressed while the system is in operation. This capability allows equipment providers to maintain their customer´s confidence in their ability to design, develop and deliver robust products and allow continuous system operation in the presence of failures. In this paper, we discuss the fundamental and phenomenon of aging, modeling and simulation, correlation between simulation and actual devices, sensor designs, mitigation strategies, impact on system reliability and availability and the reliability prediction.
Keywords :
fault diagnosis; maintenance engineering; reliability; ASIC; DFR methodologies; aging techniques; application-specific integrated circuits; failure mitigation techniques; memories; product failure; product operation; product overall fault management; reliability prediction; system availability; system reliability; Aging; Degradation; Delays; Human computer interaction; Logic gates; Random access memory; Reliability; ASIC; Aging; Hardware; Memory; Reliability;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Reliability and Maintainability Symposium (RAMS), 2014 Annual
Conference_Location :
Colorado Springs, CO
Print_ISBN :
978-1-4799-2847-7
Type :
conf
DOI :
10.1109/RAMS.2014.6798522
Filename :
6798522
Link To Document :
بازگشت