DocumentCode :
2615648
Title :
Fault-tolerant parallel matrix multiplication with one iteration fault detection latency
Author :
Hong, Chul-Eui ; McMillin, Bruce M.
Author_Institution :
Dept. of Comput. Sci., Missouri Univ., Rolla, MO, USA
fYear :
1991
fDate :
11-13 Sep 1991
Firstpage :
665
Lastpage :
672
Abstract :
A new algorithm, the ID algorithm, is presented which minimizes the fault-detection latency. In the ID algorithm, a fault is detected as soon as the fault occurs instead of at problem termination. For n 2 processors, the fault-latency time of the ID algorithm is 1/n of that of the checksum algorithm with a run-time penalty of O(n log2 n) in an n ×n matrix operation. This algorithm has better performance in terms of error coverage and expected run time in large-scale matrix multiplications such as signal and image processing, weather prediction, and finite-element analysis
Keywords :
fault tolerant computing; parallel algorithms; software reliability; ID algorithm; checksum algorithm; error coverage; fault tolerant parallel matrix multiplication; fault-detection latency; finite-element analysis; image processing; large-scale matrix multiplications; one iteration fault detection latency; run-time penalty; signal processing; weather prediction; Delay; Fault detection; Fault tolerance; Finite element methods; Image analysis; Image processing; Large-scale systems; Runtime; Signal processing; Weather forecasting;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Software and Applications Conference, 1991. COMPSAC '91., Proceedings of the Fifteenth Annual International
Conference_Location :
Tokyo
Print_ISBN :
0-8186-2152-4
Type :
conf
DOI :
10.1109/CMPSAC.1991.170258
Filename :
170258
Link To Document :
بازگشت