• DocumentCode
    2049890
  • Title

    AA-Dedupe: An Application-Aware Source Deduplication Approach for Cloud Backup Services in the Personal Computing Environment

  • Author

    Fu, Yinjin ; Jian, Hong ; Xiao, Nong ; Tian, Lei ; Liu, Fang

  • Author_Institution
    Sch. of Comput., Nat. Univ. of Defense Technol., Changsha, China
  • fYear
    2011
  • fDate
    26-30 Sept. 2011
  • Firstpage
    112
  • Lastpage
    120
  • Abstract
    The market for cloud backup services in the personal computing environment is growing due to large volumes of valuable personal and corporate data being stored on desktops, laptops and smart phones. Source deduplication has become a mainstay of cloud backup that saves network bandwidth and reduces storage space. However, there are two challenges facing deduplication for cloud backup service clients: (1) low deduplication efficiency due to a combination of the resource-intensive nature of deduplication and the limited system resources on the PC-based client site, and (2) low data transfer efficiency since post-deduplication data transfers from source to backup servers are typically very small but must often cross a WAN. In this paper, we present AA-Dedupe, an application-aware source deduplication scheme, to significantly reduce the computational overhead, increase the deduplication throughput and improve the data transfer efficiency. The AA-Dedupe approach is motivated by our key observations of the substantial differences among applications in data redundancy and deduplication characteristics, and thus is based on an application-aware index structure that effectively exploits this application awareness. Our experimental evaluations, based on an AA-Dedupe prototype implementation, show that our scheme can improve deduplication efficiency over the state-of-art source-deduplication methods by a factor of 2-7, resulting in shortened backup window, increased power-efficiency and reduced cost for cloud backup services.
  • Keywords
    back-up procedures; bandwidth allocation; cloud computing; personal computing; wide area networks; AA-Dedupe; WAN; application-aware index structure; application-aware source deduplication approach; cloud backup services; data redundancy; data transfer efficiency; network bandwidth saving; personal computing environment; storage space reduction; Bandwidth; Cloud computing; Fingerprint recognition; Indexes; Redundancy; Throughput; Wide area networks; Cloud backup; application-aware index; deduplication efficiency; intelligent data chunking; source deduplication;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Cluster Computing (CLUSTER), 2011 IEEE International Conference on
  • Conference_Location
    Austin, TX
  • Print_ISBN
    978-1-4577-1355-2
  • Electronic_ISBN
    978-0-7695-4516-5
  • Type

    conf

  • DOI
    10.1109/CLUSTER.2011.20
  • Filename
    6061046