DocumentCode :
301049
Title :
Experimental evaluation of the fail-silent behaviour in programs with consistency checks
Author :
Rela, Mário Zenha ; Madeira, Henrique ; Silva, João Gabriel
Author_Institution :
Dept. of Eng. Inf., Coimbra Univ., Portugal
fYear :
1996
fDate :
25-27 Jun 1996
Firstpage :
394
Lastpage :
403
Abstract :
An important research topic deals with the investigation of whether a non-duplicated computer can be made fail-silent, since that behaviour is a-priori assumed in many algorithms. However, previous research has shown that in systems using a simple behaviour based error detection mechanism invisible to the programmer (e.g. memory protection), the percentage of fail-silent violations could be higher than 10%. Since the study of these errors has shown that they were mostly caused by pure data errors, we evaluate the effectiveness of software techniques capable of checking the semantics of the data, such as assertions, to detect these remaining errors. The results of injecting physical pin-level faults show that these tests can prevent about 40% of the fail-silent model violations that escape the simple hardware-based error detection techniques. In order to decouple the intrinsic limitations of the tests used from other factors that might affect its error detection capabilities, we evaluated a special class of software checks known for its high theoretical coverage: algorithm based fault tolerance (ABFT). The analysis of the remaining errors showed that most of them remained undetected due to short range control flow errors. When very simple software-based control flow checking was associated to the semantic tests, the target system, without any dedicated error detection hardware, behaved according to the fail-silent model for about 98% of all the faults injected
Keywords :
data flow analysis; data integrity; program debugging; program diagnostics; program verification; software fault tolerance; algorithm based fault tolerance; behaviour based error detection; consistency checking; control flow checking; data semantics checking; fail-silent computer; fail-silent program behaviour; hardware-based error detection; memory protection; nonduplicated computer; semantic tests; short range control flow errors; software verification; Computer errors; Error correction; Fault detection; Fault tolerance; Hardware; Programming profession; Protection; Software algorithms; Software testing; System testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Fault Tolerant Computing, 1996., Proceedings of Annual Symposium on
Conference_Location :
Sendai
ISSN :
0731-3071
Print_ISBN :
0-8186-7262-5
Type :
conf
DOI :
10.1109/FTCS.1996.534625
Filename :
534625
Link To Document :
بازگشت