DocumentCode :
1583895
Title :
Fault-tolerant parallel processing with real-time error detection and recovery
Author :
Chen, Chung-Ho ; Somani, Arun K.
Author_Institution :
Dept. of Electr. Eng., Washington Univ., Seattle, WA, USA
fYear :
1992
Firstpage :
994
Abstract :
Presents a unique error recovery mechanism based on new cache states, verified and non-verified, to detect and recover errors produced by the processor or cache memory or both due to transient faults. The scheme remedies the insufficiency of the error-correcting code when faced with processor transient faults. This cache-based recovery method not only recovers errors in a local cache memory but also prevents the propagation of error to other caches. It is shown that this new error recovery scheme can be easily integrated with existing cache coherency protocols
Keywords :
fault tolerant computing; parallel processing; real-time systems; system recovery; cache coherency protocols; cache memory; cache states; error recovery mechanism; error-correcting code; real-time error detection; transient faults; Cache memory; Electrical fault detection; Error correction codes; Fault detection; Fault tolerance; Fault tolerant systems; Parallel processing; Protocols; Real time systems; Reliability;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signals, Systems and Computers, 1992. 1992 Conference Record of The Twenty-Sixth Asilomar Conference on
Conference_Location :
Pacific Grove, CA
ISSN :
1058-6393
Print_ISBN :
0-8186-3160-0
Type :
conf
DOI :
10.1109/ACSSC.1992.269071
Filename :
269071
Link To Document :
بازگشت