DocumentCode :
1864021
Title :
Low Overhead Incremental Checkpointing and Rollback Recovery Scheme on Windows Operating System
Author :
Chen, Chih-Ho ; Ting, Yung ; Heh, Jia-Sheng
Author_Institution :
Dept. of Mech. Eng., Chung Yuan Christian Univ., Chungli, Taiwan
fYear :
2010
fDate :
9-10 Jan. 2010
Firstpage :
268
Lastpage :
271
Abstract :
Implementation of a low overhead incremental checkpointing and rollback recovery scheme that consists of incremental checkpointing combines copy-on-write technique and optimal checkpointing interval is addressed in this article. The checkpointing permits to save process state periodically during failure-free execution, and the recovery scheme maintains to normally execute the task when failure occurs in a PC-based computer-controlled system employed with Windows Operating System. Excess size of capturing state and arbitrary checkpointing results in either performance degradation or expensive recovery cost. For the objective of minimizing overhead, the checkpointing and recovery scheme is designed of Win32 API interception associated with incremental checkpointing and copy-on-write technique. Instead of saving entire process space, it only needs to save the modified pages and uses buffer to save state temporarily in the process of checkpointing so that the checkpointing overhead is reduced. While system is encountered with failure, the minimum expected time of the total overhead to complete a task is calculated by using probability to find the optimal checkpointing interval. From simulation results, the proposed checkpointing and rollback recovery scheme not only enhances the capability of the normal task executing but also reduces the overhead of checkpointing and recovery.
Keywords :
checkpointing; operating systems (computers); PC-based computer-controlled system; Win32 API interception; Windows Operating System; Windows operating system; checkpointing overhead; copy-on-write technique; failure free execution; low overhead incremental checkpointing; optimal checkpointing interval; performance degradation; rollback recovery scheme; Application software; Checkpointing; Computer errors; Data engineering; Data mining; Fault detection; Hardware; Knowledge engineering; Mechanical engineering; Operating systems; copy-on-write; incremental checkpointing; optimal interval; recovery;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Knowledge Discovery and Data Mining, 2010. WKDD '10. Third International Conference on
Conference_Location :
Phuket
Print_ISBN :
978-1-4244-5397-9
Electronic_ISBN :
978-1-4244-5398-6
Type :
conf
DOI :
10.1109/WKDD.2010.135
Filename :
5432637
Link To Document :
بازگشت