Title : 
Optimizing Live Migration of Virtual Machines in SMP Clusters for HPC Applications
         
        
            Author : 
Atif, Muhammad ; Strazdins, Peter
         
        
            Author_Institution : 
Dept. of Comput. Sci., Australian Nat. Univ., Canberra, ACT, Australia
         
        
        
        
        
        
            Abstract : 
Live migration is one of the most useful features provided by todays´ virtual machine monitors (VMM). It enables seamless hardware upgrades, provides fault tolerance, achieves load balancing and saves power through server consolidation. These features can also be beneficial in HPC environments. This paper presents a comprehensive study of the migration facility of the Xen VMM, specifically targeting HPC applications. We analyze the effects of live and non-live migration techniques on HPC application wall times. A detailed relationship of the migration routine with memory modification, communication intensitivity and CPU contention between guest VMs and the host VMM is presented. We propose a simple optimization for the live migration feature. Our optimization is able to reduce the total number of memory pages transferred during the migration by up to 500% and results show an average of 50% improvement over the default Xen migration routine on the traditional gigabit Ethernet infrastructure. We also demonstrate that live migration of virtual machines in an HPC environment can be used to improve application wall times.
         
        
            Keywords : 
fault tolerant computing; multiprocessing systems; operating systems (computers); parallel processing; resource allocation; virtual machines; CPU contention; HPC application; SMP cluster; Xen VMM; communication intensitivity; fault tolerance; gigabit Ethernet infrastructure; hardware upgrade; live migration; load balancing; memory modification; nonlive migration; operating system; optimization; power saving; server consolidation; virtual machine monitor; Application software; Computer science; Fault tolerance; Hardware; Load management; Parallel processing; Processor scheduling; Virtual machine monitors; Virtual machining; Voice mail;
         
        
        
        
            Conference_Titel : 
Network and Parallel Computing, 2009. NPC '09. Sixth IFIP International Conference on
         
        
            Conference_Location : 
Gold Coast, QLD
         
        
            Print_ISBN : 
978-1-4244-4990-3
         
        
            Electronic_ISBN : 
978-0-7695-3837-2
         
        
        
            DOI : 
10.1109/NPC.2009.32