DocumentCode :
3756726
Title :
TPLCR: Time-Bound, Pre-copy Live Checkpointing and Parallel Restart of Virtual Machines Using Distributed Memory Servers
Author :
Kasidit Chanchio
Author_Institution :
Dept. of Comput. Sci., Thammasat Univ., Patumtani, Thailand
fYear :
2015
Firstpage :
1
Lastpage :
10
Abstract :
Live checkpointing of virtual machines is the ability to save the state of a virtual machine to storage while the machine is running. This paper presents a novel Time-bound, Pre-Copy Live Checkpointing and parallel Re-start mechanism (TPLCR) that implements live checkpointing based on a time-bounded, pre-copy live migration algorithm. The performance improvements of TPLCR rely on the use of multiple Distributed Memory Servers to allow fast, in-memory checkpointing and parallel restart. Along with the new TPLCR protocol, we introduce the Checkpoint-Restart Service to manage the checkpoint and restart operations in a datacenter. This paper describes a prototype implementation of TPLCR based on KVM. A series of checkpointing experiments were conducted using four CPU and memory intensive Class D NAS Parallel Benchmark kernels. Experimental results show that TPLCR checkpoint-restart performance is significantly better than traditional approaches.
Keywords :
"Checkpointing","Servers","Virtual machine monitors","Computers","Instruction sets","Virtual machining","Random access memory"
Publisher :
ieee
Conference_Titel :
Computing and Networking (CANDAR), 2015 Third International Symposium on
Electronic_ISBN :
2379-1896
Type :
conf
DOI :
10.1109/CANDAR.2015.108
Filename :
7424263
Link To Document :
بازگشت