DocumentCode
2897487
Title
Using likely program invariants to detect hardware errors
Author
Sahoo, Swamp Kumar ; Li, Man-Lap ; Ramachandran, Pradeep ; Adve, Sarita V. ; Adve, V.S. ; Zhou, Yuanyuan
Author_Institution
Dept. of Comput. Sci., Univ. of Illinois at Urbana-Champaign, Urbana, IL
fYear
2008
fDate
24-27 June 2008
Firstpage
70
Lastpage
79
Abstract
In the near future, hardware is expected to become increasingly vulnerable to faults due to continuously decreasing feature size. Software-level symptoms have previously been used to detect permanent hardware faults. However, they can not detect a small fraction of faults, which may lead to silent data corruptions(SDCs). In this paper, we present a system that uses invariants to improve the coverage and latency of existing detection techniques for permanent faults. The basic idea is to use training inputs to create likely invariants based on value ranges of selected program variables and then use them to identify faults at runtime. Likely invariants, however, can have false positives which makes them challenging to use for permanent faults. We use our on-line diagnosis framework for detecting false positives at runtime and limit the number of false positives to keep the associated overhead minimal. Experimental results using microarchitecture level fault injections in full-system simulation show 28.6% reduction in the number of undetected faults and 74.2% reduction in the number of SDCs over existing techniques, with reasonable overhead for checking code.
Keywords
fault diagnosis; software architecture; software fault tolerance; likely program invariants; microarchitecture level fault injections; on-line diagnosis framework; permanent faults; silent data corruptions; Checkpointing; Circuit faults; Computer errors; Delay; Detectors; Electrical fault detection; Fault detection; Hardware; Microprocessors; Runtime;
fLanguage
English
Publisher
ieee
Conference_Titel
Dependable Systems and Networks With FTCS and DCC, 2008. DSN 2008. IEEE International Conference on
Conference_Location
Anchorage, AK
Print_ISBN
978-1-4244-2397-2
Electronic_ISBN
978-1-4244-2398-9
Type
conf
DOI
10.1109/DSN.2008.4630072
Filename
4630072
Link To Document