DocumentCode
1474083
Title
Live Virtual Machine Migration via Asynchronous Replication and State Synchronization
Author
Liu, Haikun ; Jin, Hai ; Liao, Xiaofei ; Yu, Chen ; Xu, Cheng-Zhong
Author_Institution
Sch. of Comput. Sci. & Technol., Huazhong Univ. of Sci. & Technol., Wuhan, China
Volume
22
Issue
12
fYear
2011
Firstpage
1986
Lastpage
1999
Abstract
Live migration of virtual machines (VM) across physical hosts provides a significant new benefit for administrators of data centers and clusters. Previous memory-to-memory approaches demonstrate the effectiveness of live VM migration in local area networks (LAN), but they would cause a long period of downtime in a wide area network (WAN) environment. This paper describes the design and implementation of a novel approach, namely, CR/TR-Motion, which adopts checkpointing/recovery and trace/replay technologies to provide fast, transparent VM migration for both LAN and WAN environments. With execution trace logged on the source host, a synchronization algorithm is performed to orchestrate the running source and target VMs until they reach a consistent state. CR/TR-Motion can greatly reduce the migration downtime and network bandwidth consumption. Experimental results show that the approach can drastically reduce migration overheads compared with memory-to-memory approach in a LAN: up to 72.4 percent on application observed downtime, up to 31.5 percent on total migration time, and up to 95.9 percent on the data to synchronize the VM state. The application performance overhead due to migration is kept within 8.54 percent on average. The results also show that for a variety of workloads migrated across WANs, the migration downtime is less than 300 milliseconds.
Keywords
checkpointing; computer centres; local area networks; virtual machines; wide area networks; CR/TR-Motion; asynchronous replication; checkpointing technology; data centers; data clusters; live virtual machine migration; local area networks; migration downtime reduction; network bandwidth consumption reduction; recovery technology; replay technology; state synchronization; trace technology; wide area network; Algorithm design and analysis; Checkpointing; Local area networks; Synchronization; Virtual machining; Wide area networks; Copy-on-write checkpointing; deterministic replay; live migration; trace; virtual machine.;
fLanguage
English
Journal_Title
Parallel and Distributed Systems, IEEE Transactions on
Publisher
ieee
ISSN
1045-9219
Type
jour
DOI
10.1109/TPDS.2011.86
Filename
5733342
Link To Document