TY - GEN
T1 - Parallel compression of correlated files
AU - Meiri, Ehud
AU - Barak, Amnon
PY - 2007
Y1 - 2007
N2 - Economy-based admission control of jobs in a grid, or migration of guest jobs from a disconnecting cluster in a grid, as well as checkpointing parallel jobs in a cluster to a central repository are demanding tasks that can exhaust essential resources such as the communication networks, due to the requirement to quickly move large amounts of data from many nodes. Compressing memory images might make these operations more efficient provided that the overall throughput is increased. Existing serial compression algorithms are not suitable for such purposes because they do not exploit inter-file redundancy. This paper presents decentralized algorithms for parallel compression of correlated memory images of a job in a cluster or in a grid. The algorithms use block suppression to eliminate inter-file redundancy. They take advantage of the multiple processor environment to simultaneously map memory blocks to hash values in order to detect redundancies. It is shown that exploiting inter-file redundancy of correlated files can increase the overall transfer throughput of parallel jobs. It is also shown that combining serial compression with our algorithms further increases this throughput. The paper presents the algorithms and their performance.
AB - Economy-based admission control of jobs in a grid, or migration of guest jobs from a disconnecting cluster in a grid, as well as checkpointing parallel jobs in a cluster to a central repository are demanding tasks that can exhaust essential resources such as the communication networks, due to the requirement to quickly move large amounts of data from many nodes. Compressing memory images might make these operations more efficient provided that the overall throughput is increased. Existing serial compression algorithms are not suitable for such purposes because they do not exploit inter-file redundancy. This paper presents decentralized algorithms for parallel compression of correlated memory images of a job in a cluster or in a grid. The algorithms use block suppression to eliminate inter-file redundancy. They take advantage of the multiple processor environment to simultaneously map memory blocks to hash values in order to detect redundancies. It is shown that exploiting inter-file redundancy of correlated files can increase the overall transfer throughput of parallel jobs. It is also shown that combining serial compression with our algorithms further increases this throughput. The paper presents the algorithms and their performance.
UR - http://www.scopus.com/inward/record.url?scp=53349175889&partnerID=8YFLogxK
U2 - 10.1109/CLUSTR.2007.4629242
DO - 10.1109/CLUSTR.2007.4629242
M3 - ???researchoutput.researchoutputtypes.contributiontobookanthology.conference???
AN - SCOPUS:53349175889
SN - 1424413885
SN - 9781424413881
T3 - Proceedings - IEEE International Conference on Cluster Computing, ICCC
SP - 285
EP - 292
BT - Proceedings - 2007 IEEE International Conference on Cluster Computing, CLUSTER 2007
T2 - 2007 IEEE International Conference on Cluster Computing, CLUSTER 2007
Y2 - 19 September 2007 through 20 September 2007
ER -