Title :
Cache-aided rollback error recovery (CARER) algorithm for shared-memory multiprocessor systems
Author :
Ahmed, R.E. ; Frazier, R.C. ; Marinos, P.N.
Author_Institution :
Dept. of Electr. Eng., Duke Univ., Durham, NC, USA
Abstract :
Three cache-aided error-recovery algorithms for use in shared-memory multiprocessor systems are presented. They rely on hardware and specially designed cache memory for all their soft error management operations and can be easily incorporated into existing cache-coherence protocols. An example illustrating their use in a multiprocessor system employing Dragon as its cache-coherence protocol is given, and the results of a tradeoff analysis are presented.<>
Keywords :
fault tolerant computing; multiprocessing systems; Dragon; cache-aided rollback error recovery; cache-coherence protocols; shared-memory multiprocessor systems; soft error management operations; tradeoff analysis; Algorithm design and analysis; Cache memory; Cache storage; Checkpointing; Hardware; Multiprocessing systems; Programming profession; Protocols; Tires; Transient analysis;
Conference_Titel :
Fault-Tolerant Computing, 1990. FTCS-20. Digest of Papers., 20th International Symposium
Conference_Location :
Newcastle Upon Tyne, UK
Print_ISBN :
0-8186-2051-X
DOI :
10.1109/FTCS.1990.89338