Combining virtual machine migration with process migration for HPC on multi-clusters and grids

Tal Maoz*, Amnon Barak, Lior Amar

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

21 Scopus citations

Abstract

The renewed interest in virtualization gives rise to new opportunities for running High Performance Computing (HPC) applications on clusters and Grids. These include the ability to create a uniform (virtual) run-time environment on top of a multitude of hardware and software platforms, and the possibility for dynamic resource allocation towards the improvement of process performance, e.g., by Virtual Machine (VM) migration as a means for load-balancing. This paper deals with issues related to running HPC applications on multi-clusters and Grids using VMware, a virtualization package running on Windows, Linux and OS X. The paper presents the "Jobrun" system for transparent, on-demand VM launching upon job submission, and its integration with the MOSIX cluster and Grid management system. We present a novel approach to job migration, combining VM migration with process migration using Jobrun, by which it is possible to migrate groups of processes and parallel jobs among different clusters in a multi-cluster or in a Grid. We use four real HPC applications to evaluate the overheads of VMware (both on Linux and Windows), the MOSIX cluster extensions and their combination, and present detailed measurements of the performance of Jobrun.

Original languageEnglish
Title of host publicationProceedings of the 2008 IEEE International Conference on Cluster Computing, CCGRID 2008
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages89-98
Number of pages10
ISBN (Print)9781424426409
DOIs
StatePublished - 2008
Event2008 IEEE International Conference on Cluster Computing, ICCC 2008 - Tsukuba, Japan
Duration: 29 Sep 20081 Oct 2008

Publication series

NameProceedings - IEEE International Conference on Cluster Computing, ICCC
VolumeProceedings of the 2008 IEEE International Conference on Clus...
ISSN (Print)1552-5244

Conference

Conference2008 IEEE International Conference on Cluster Computing, ICCC 2008
Country/TerritoryJapan
CityTsukuba
Period29/09/081/10/08

Fingerprint

Dive into the research topics of 'Combining virtual machine migration with process migration for HPC on multi-clusters and grids'. Together they form a unique fingerprint.

Cite this