DocumentCode
2049890
Title
AA-Dedupe: An Application-Aware Source Deduplication Approach for Cloud Backup Services in the Personal Computing Environment
Author
Fu, Yinjin ; Jian, Hong ; Xiao, Nong ; Tian, Lei ; Liu, Fang
Author_Institution
Sch. of Comput., Nat. Univ. of Defense Technol., Changsha, China
fYear
2011
fDate
26-30 Sept. 2011
Firstpage
112
Lastpage
120
Abstract
The market for cloud backup services in the personal computing environment is growing due to large volumes of valuable personal and corporate data being stored on desktops, laptops and smart phones. Source deduplication has become a mainstay of cloud backup that saves network bandwidth and reduces storage space. However, there are two challenges facing deduplication for cloud backup service clients: (1) low deduplication efficiency due to a combination of the resource-intensive nature of deduplication and the limited system resources on the PC-based client site, and (2) low data transfer efficiency since post-deduplication data transfers from source to backup servers are typically very small but must often cross a WAN. In this paper, we present AA-Dedupe, an application-aware source deduplication scheme, to significantly reduce the computational overhead, increase the deduplication throughput and improve the data transfer efficiency. The AA-Dedupe approach is motivated by our key observations of the substantial differences among applications in data redundancy and deduplication characteristics, and thus is based on an application-aware index structure that effectively exploits this application awareness. Our experimental evaluations, based on an AA-Dedupe prototype implementation, show that our scheme can improve deduplication efficiency over the state-of-art source-deduplication methods by a factor of 2-7, resulting in shortened backup window, increased power-efficiency and reduced cost for cloud backup services.
Keywords
back-up procedures; bandwidth allocation; cloud computing; personal computing; wide area networks; AA-Dedupe; WAN; application-aware index structure; application-aware source deduplication approach; cloud backup services; data redundancy; data transfer efficiency; network bandwidth saving; personal computing environment; storage space reduction; Bandwidth; Cloud computing; Fingerprint recognition; Indexes; Redundancy; Throughput; Wide area networks; Cloud backup; application-aware index; deduplication efficiency; intelligent data chunking; source deduplication;
fLanguage
English
Publisher
ieee
Conference_Titel
Cluster Computing (CLUSTER), 2011 IEEE International Conference on
Conference_Location
Austin, TX
Print_ISBN
978-1-4577-1355-2
Electronic_ISBN
978-0-7695-4516-5
Type
conf
DOI
10.1109/CLUSTER.2011.20
Filename
6061046
Link To Document