Title :
Phase-based reboot: Reusing operating system execution phases for cheap reboot-based recovery
Author :
Yamakita, Kazuya ; Yamada, Hiroshi ; Kono, Kenji
Author_Institution :
Keio Univ., Yokohama, Japan
Abstract :
Although operating systems (OSes) are crucial to achieving high availability of computer systems, modern OSes are far from bug-free. Rebooting the OS is simple, powerful, and sometimes the only remedy for kernel failures. Once we accept reboot-based recovery as a fact of life, we should try to ensure that the downtime caused by reboots is as short as possible. This paper presents “phase-based” reboots that shorten the downtime caused by reboot-based recovery. The key idea is to divide a boot sequence into phases. The phase-based reboot reuses a system state in the previous boot if the next boot reproduces the same state. A prototype of the phase-based reboot was implemented on Xen 3.4.1 running para-virtualized Linux 2.6.18. Experiments with the prototype show that it successfully recovered from kernel transient failures inserted by a fault injector, and its downtime was 34.3 to 93.6% shorter than that of the normal reboot-based recovery.
Keywords :
operating systems (computers); software reusability; Xen 3.4.1; cheap reboot-based recovery; operating system reuse; para-virtualized Linux 2.6.18; phase-based reboot; Hardware; Image restoration; Kernel; Linux; Random access memory; Transient analysis; Operating System Reliability; Reboot-based Recovery; Virtualization;
Conference_Titel :
Dependable Systems & Networks (DSN), 2011 IEEE/IFIP 41st International Conference on
Conference_Location :
Hong Kong
Print_ISBN :
978-1-4244-9232-9
Electronic_ISBN :
1530-0889
DOI :
10.1109/DSN.2011.5958216