Title :
Exploiting data flow information in algorithm-based fault tolerance
Author :
Shamsunder, Ragini ; Rosenkrantz, Daniel J. ; Ravi, S.S.
Author_Institution :
Dept. of Comput. Sci., Albany Univ., NY, USA
Abstract :
The authors examine the use of data flow information in algorithm-based fault tolerance for multiprocessor computations. They show that this information can lead to a more efficient design, by reducing the number of checks. They investigate both the analysis problem of determining the fault tolerance measures of a given system, and the design problem of constructing compact check sets to detect or locate a given number of faults. Specifically, it is shown that the analysis problem can be solved efficiently when the number of faults is fixed. The computational difficulty of this problem when the number of faults is not fixed is addressed. The authors consider the design problem for special classes of data flow graphs and establish upper and lower bounds on the number of checks needed to detect or locate a given number of faults.
Keywords :
fault tolerant computing; algorithm-based fault tolerance; compact check sets; computational difficulty; data flow graphs; data flow information exploitation; lower bounds; multiprocessor computations; upper bounds; Algorithm design and analysis; Computer science; Data flow computing; Fault detection; Fault location; Fault tolerance; Fault tolerant systems; Flow graphs; Multiprocessing systems; Polynomials;
Conference_Titel :
Fault-Tolerant Computing, 1993. FTCS-23. Digest of Papers., The Twenty-Third International Symposium on
Conference_Location :
Toulouse, France
Print_ISBN :
0-8186-3680-7
DOI :
10.1109/FTCS.1993.627331