• DocumentCode
    1474083
  • Title

    Live Virtual Machine Migration via Asynchronous Replication and State Synchronization

  • Author

    Liu, Haikun ; Jin, Hai ; Liao, Xiaofei ; Yu, Chen ; Xu, Cheng-Zhong

  • Author_Institution
    Sch. of Comput. Sci. & Technol., Huazhong Univ. of Sci. & Technol., Wuhan, China
  • Volume
    22
  • Issue
    12
  • fYear
    2011
  • Firstpage
    1986
  • Lastpage
    1999
  • Abstract
    Live migration of virtual machines (VM) across physical hosts provides a significant new benefit for administrators of data centers and clusters. Previous memory-to-memory approaches demonstrate the effectiveness of live VM migration in local area networks (LAN), but they would cause a long period of downtime in a wide area network (WAN) environment. This paper describes the design and implementation of a novel approach, namely, CR/TR-Motion, which adopts checkpointing/recovery and trace/replay technologies to provide fast, transparent VM migration for both LAN and WAN environments. With execution trace logged on the source host, a synchronization algorithm is performed to orchestrate the running source and target VMs until they reach a consistent state. CR/TR-Motion can greatly reduce the migration downtime and network bandwidth consumption. Experimental results show that the approach can drastically reduce migration overheads compared with memory-to-memory approach in a LAN: up to 72.4 percent on application observed downtime, up to 31.5 percent on total migration time, and up to 95.9 percent on the data to synchronize the VM state. The application performance overhead due to migration is kept within 8.54 percent on average. The results also show that for a variety of workloads migrated across WANs, the migration downtime is less than 300 milliseconds.
  • Keywords
    checkpointing; computer centres; local area networks; virtual machines; wide area networks; CR/TR-Motion; asynchronous replication; checkpointing technology; data centers; data clusters; live virtual machine migration; local area networks; migration downtime reduction; network bandwidth consumption reduction; recovery technology; replay technology; state synchronization; trace technology; wide area network; Algorithm design and analysis; Checkpointing; Local area networks; Synchronization; Virtual machining; Wide area networks; Copy-on-write checkpointing; deterministic replay; live migration; trace; virtual machine.;
  • fLanguage
    English
  • Journal_Title
    Parallel and Distributed Systems, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1045-9219
  • Type

    jour

  • DOI
    10.1109/TPDS.2011.86
  • Filename
    5733342