Title :
Early comparison and decision strategies for datapaths that recover from transient faults
Author :
Narasimhan, Ragini ; Rosenkrantz, Daniel J. ; Ravi, S.S.
Author_Institution :
Dept. of Comput. Sci., State Univ. of New York, Albany, NY, USA
fDate :
5/1/1997 12:00:00 AM
Abstract :
Rollback and retry is a common approach used to achieve error recovery in datapaths that tolerate transient faults. In this approach, each segment of a computation is duplicated and the results are compared using fault-tolerant comparators. If the compared values are unequal, the segment is rolled back to the preceding correct state (rollback point) and retried from that state. We introduce early comparison and rollback strategies for use in such datapaths. These strategies utilize comparators during the computational portion of the segment and can initiate a rollback before the segment is completed. We illustrate through examples how these strategies can reduce hardware costs (number of comparators needed) and the delay in recovering from a transient fault compared to conventional strategies
Keywords :
application specific integrated circuits; comparators (circuits); delays; fault tolerant computing; high level synthesis; integrated circuit design; integrated circuit reliability; system recovery; transients; ASIC; datapaths; decision strategies; delay reduction; early comparison strategies; error recovery; fault-tolerant comparators; rollback strategies; transient faults; Application specific integrated circuits; Circuit faults; Costs; Data flow computing; Delay; Digital signal processing; Fault tolerance; Hardware; High level synthesis; Processor scheduling;
Journal_Title :
Circuits and Systems I: Fundamental Theory and Applications, IEEE Transactions on