• DocumentCode
    2888264
  • Title

    Parallel compression of correlated files

  • Author

    Meiri, Ehud ; Barak, Amnon

  • Author_Institution
    Dept. of Comput. Sci., Hebrew Univ. of Jerusalem, Jerusalem
  • fYear
    2007
  • fDate
    17-20 Sept. 2007
  • Firstpage
    285
  • Lastpage
    292
  • Abstract
    Economy-based admission control of jobs in a grid, or migration of guest jobs from a disconnecting cluster in a grid, as well as checkpointing parallel jobs in a cluster to a central repository are demanding tasks that can exhaust essential resources such as the communication networks, due to the requirement to quickly move large amounts of data from many nodes. Compressing memory images might make these operations more efficient provided that the overall throughput is increased. Existing serial compression algorithms are not suitable for such purposes because they do not exploit inter-file redundancy. This paper presents decentralized algorithms for parallel compression of correlated memory images of a job in a cluster or in a grid. The algorithms use block suppression to eliminate inter-file redundancy. They take advantage of the multiple processor environment to simultaneously map memory blocks to hash values in order to detect redundancies. It is shown that exploiting inter-file redundancy of correlated files can increase the overall transfer throughput of parallel jobs. It is also shown that combining serial compression with our algorithms further increases this throughput. The paper presents the algorithms and their performance.
  • Keywords
    checkpointing; cryptography; data compression; file organisation; grid computing; parallel algorithms; block suppression; checkpointing; correlated file parallel compression; correlated memory image compression; decentralized algorithm; economy-based admission control; grid computing; hash value; inter-file redundancy elimination; Admission control; Checkpointing; Clustering algorithms; Communication networks; Compression algorithms; Computer science; Image coding; Image storage; Partitioning algorithms; Throughput;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Cluster Computing, 2007 IEEE International Conference on
  • Conference_Location
    Austin, TX
  • ISSN
    1552-5244
  • Print_ISBN
    978-1-4244-1387-4
  • Electronic_ISBN
    1552-5244
  • Type

    conf

  • DOI
    10.1109/CLUSTR.2007.4629242
  • Filename
    4629242