TY - GEN
T1 - Combining virtual machine migration with process migration for HPC on multi-clusters and grids
AU - Maoz, Tal
AU - Barak, Amnon
AU - Amar, Lior
PY - 2008
Y1 - 2008
N2 - The renewed interest in virtualization gives rise to new opportunities for running High Performance Computing (HPC) applications on clusters and Grids. These include the ability to create a uniform (virtual) run-time environment on top of a multitude of hardware and software platforms, and the possibility for dynamic resource allocation towards the improvement of process performance, e.g., by Virtual Machine (VM) migration as a means for load-balancing. This paper deals with issues related to running HPC applications on multi-clusters and Grids using VMware, a virtualization package running on Windows, Linux and OS X. The paper presents the "Jobrun" system for transparent, on-demand VM launching upon job submission, and its integration with the MOSIX cluster and Grid management system. We present a novel approach to job migration, combining VM migration with process migration using Jobrun, by which it is possible to migrate groups of processes and parallel jobs among different clusters in a multi-cluster or in a Grid. We use four real HPC applications to evaluate the overheads of VMware (both on Linux and Windows), the MOSIX cluster extensions and their combination, and present detailed measurements of the performance of Jobrun.
AB - The renewed interest in virtualization gives rise to new opportunities for running High Performance Computing (HPC) applications on clusters and Grids. These include the ability to create a uniform (virtual) run-time environment on top of a multitude of hardware and software platforms, and the possibility for dynamic resource allocation towards the improvement of process performance, e.g., by Virtual Machine (VM) migration as a means for load-balancing. This paper deals with issues related to running HPC applications on multi-clusters and Grids using VMware, a virtualization package running on Windows, Linux and OS X. The paper presents the "Jobrun" system for transparent, on-demand VM launching upon job submission, and its integration with the MOSIX cluster and Grid management system. We present a novel approach to job migration, combining VM migration with process migration using Jobrun, by which it is possible to migrate groups of processes and parallel jobs among different clusters in a multi-cluster or in a Grid. We use four real HPC applications to evaluate the overheads of VMware (both on Linux and Windows), the MOSIX cluster extensions and their combination, and present detailed measurements of the performance of Jobrun.
UR - http://www.scopus.com/inward/record.url?scp=57949111058&partnerID=8YFLogxK
U2 - 10.1109/CLUSTR.2008.4663759
DO - 10.1109/CLUSTR.2008.4663759
M3 - ???researchoutput.researchoutputtypes.contributiontobookanthology.conference???
AN - SCOPUS:57949111058
SN - 9781424426409
T3 - Proceedings - IEEE International Conference on Cluster Computing, ICCC
SP - 89
EP - 98
BT - Proceedings of the 2008 IEEE International Conference on Cluster Computing, CCGRID 2008
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2008 IEEE International Conference on Cluster Computing, ICCC 2008
Y2 - 29 September 2008 through 1 October 2008
ER -