Title :
Estimating Error-probability and its Application for Optimizing Roll-back Recovery with Checkpointing
Author :
Nikolov, Dimitar ; Ingelsson, Urban ; Singh, Virendra ; Larsson, Erik
Author_Institution :
Dept. of Comput. Sci., Linkoping Univ., Linkoping, Sweden
Abstract :
The probability for errors to occur in electronic systems is not known in advance, but depends on many factors including influence from the environment where the system operates. In this paper, it is demonstrated that inaccurate estimates of the error probability lead to loss of performance in a well known fault tolerance technique, Roll-back Recovery with checkpointing (RRC). To regain the lost performance, a method for estimating the error probability along with an adjustment technique are proposed. Using a simulator tool that has been developed to enable experimentation, the proposed method is evaluated and the results show that the proposed method provides useful estimates of the error probability leading to near-optimal performance of the RRC fault-tolerant technique.
Keywords :
checkpointing; error statistics; fault tolerance; integrated circuit testing; system-on-chip; error probability estimation; fault tolerance; roll-back recovery with checkpointing; system-on-chip; Application software; Checkpointing; Communication system control; Computer errors; Electronic equipment testing; Error correction codes; Error probability; Fault detection; Fault tolerance; Performance loss; error-probability estimation; optimization; roll-back recovery with checkpointing;
Conference_Titel :
Electronic Design, Test and Application, 2010. DELTA '10. Fifth IEEE International Symposium on
Conference_Location :
Ho Chi Minh City
Print_ISBN :
978-0-7695-3978-2
Electronic_ISBN :
978-1-4244-6026-7
DOI :
10.1109/DELTA.2010.25