DocumentCode
2888264
Title
Parallel compression of correlated files
Author
Meiri, Ehud ; Barak, Amnon
Author_Institution
Dept. of Comput. Sci., Hebrew Univ. of Jerusalem, Jerusalem
fYear
2007
fDate
17-20 Sept. 2007
Firstpage
285
Lastpage
292
Abstract
Economy-based admission control of jobs in a grid, or migration of guest jobs from a disconnecting cluster in a grid, as well as checkpointing parallel jobs in a cluster to a central repository are demanding tasks that can exhaust essential resources such as the communication networks, due to the requirement to quickly move large amounts of data from many nodes. Compressing memory images might make these operations more efficient provided that the overall throughput is increased. Existing serial compression algorithms are not suitable for such purposes because they do not exploit inter-file redundancy. This paper presents decentralized algorithms for parallel compression of correlated memory images of a job in a cluster or in a grid. The algorithms use block suppression to eliminate inter-file redundancy. They take advantage of the multiple processor environment to simultaneously map memory blocks to hash values in order to detect redundancies. It is shown that exploiting inter-file redundancy of correlated files can increase the overall transfer throughput of parallel jobs. It is also shown that combining serial compression with our algorithms further increases this throughput. The paper presents the algorithms and their performance.
Keywords
checkpointing; cryptography; data compression; file organisation; grid computing; parallel algorithms; block suppression; checkpointing; correlated file parallel compression; correlated memory image compression; decentralized algorithm; economy-based admission control; grid computing; hash value; inter-file redundancy elimination; Admission control; Checkpointing; Clustering algorithms; Communication networks; Compression algorithms; Computer science; Image coding; Image storage; Partitioning algorithms; Throughput;
fLanguage
English
Publisher
ieee
Conference_Titel
Cluster Computing, 2007 IEEE International Conference on
Conference_Location
Austin, TX
ISSN
1552-5244
Print_ISBN
978-1-4244-1387-4
Electronic_ISBN
1552-5244
Type
conf
DOI
10.1109/CLUSTR.2007.4629242
Filename
4629242
Link To Document