Title :
Combining Virtual Machine migration with process migration for HPC on multi-clusters and Grids
Author :
Maoz, Tal ; Barak, Amnon ; Amar, Lior
Author_Institution :
Dept. of Comput. Sci., Hebrew Univ. of Jerusalem, Jerusalem
fDate :
Sept. 29 2008-Oct. 1 2008
Abstract :
The renewed interest in virtualization gives rise to new opportunities for running high performance computing (HPC) applications on clusters and grids. These include the ability to create a uniform (virtual) run-time environment on top of a multitude of hardware and software platforms, and the possibility for dynamic resource allocation towards the improvement of process performance, e.g., by virtual machine (VM) migration as a means for load-balancing. This paper deals with issues related to running HPC applications on multi-clusters and grids using VMware, a virtualization package running on Windows, Linux and OS X. The paper presents the ldquoJobrunrdquo system for transparent, on-demand VM launching upon job submission, and its integration with the MOSIX cluster and grid management system. We present a novel approach to job migration, combining VM migration with process migration using Jobrun, by which it is possible to migrate groups of processes and parallel jobs among different clusters in a multi-cluster or in a grid. We use four real HPC applications to evaluate the overheads of VMware (both on Linux and Windows), the MOSIX cluster extensions and their combination, and present detailed measurements of the performance of Jobrun.
Keywords :
grid computing; parallel processing; pattern clustering; virtual machines; Jobrun; Linux; Windows; dynamic resource allocation; hardware-software platforms; high performance computing; load-balancing; process migration; run-time environments; virtual machine migration; virtualization package; Application software; Application virtualization; Hardware; High performance computing; Linux; Resource management; Runtime environment; Software performance; Virtual machining; Virtual manufacturing;
Conference_Titel :
Cluster Computing, 2008 IEEE International Conference on
Conference_Location :
Tsukuba
Print_ISBN :
978-1-4244-2639-3
Electronic_ISBN :
1552-5244
DOI :
10.1109/CLUSTR.2008.4663759