DocumentCode
3013341
Title
Algorithm based fault tolerance versus result-checking for matrix computations
Author
Prata, Paula ; Silva, João Gabriel
Author_Institution
Dept. Math./Inf., Univeridade da Beira Interior, Coviha, Portugal
fYear
1999
fDate
15-18 June 1999
Firstpage
4
Lastpage
11
Abstract
Algorithm Based Fault Tolerance (ABFT) is the collective name of a set of techniques used to determine the correctness of some mathematical calculations. A less well known alternative is called Result Checking (RC) where, contrary to ABFT, results are checked without knowledge of the particular algorithm used to calculate them. In this paper a comparison is made between the two using some practical implementations of matrix computations. The criteria are performance and memory overhead: ease of use and error coverage. For the latter extensive error injection experiments were made. To the best of our knowledge, this is the first time that RC is validated by fault injection. We conclude that Result Checking has the important advantage of being independent of the underlying algorithm. It also has generally less performance overhead than ABFT, the two techniques being essentially equivalent in terms of error coverage.
Keywords
error detection; fault tolerant computing; matrix algebra; ABFT; Result Checking; error coverage; error detection; fault injection; fault tolerance; matrix computations; result-checking; Computer architecture; Electrical capacitance tomography; Electronic switching systems; Error correction; Fault detection; Fault tolerance; Mathematics; Protection; Research and development; Systolic arrays;
fLanguage
English
Publisher
ieee
Conference_Titel
Fault-Tolerant Computing, 1999. Digest of Papers. Twenty-Ninth Annual International Symposium on
Conference_Location
Madison, WI, USA
ISSN
0731-3071
Print_ISBN
0-7695-0213-X
Type
conf
DOI
10.1109/FTCS.1999.781028
Filename
781028
Link To Document