DocumentCode :
2537178
Title :
SAM: A Semantic-Aware Multi-tiered Source De-duplication Framework for Cloud Backup
Author :
Tan, Yujuan ; Jiang, Hong ; Feng, Dan ; Tian, Lei ; Yan, Zhichao ; Zhou, Guohui
Author_Institution :
Key Lab. of Data Storage Syst., Huazhong Univ. of Sci. & Technol., Wuhan, China
fYear :
2010
fDate :
13-16 Sept. 2010
Firstpage :
614
Lastpage :
623
Abstract :
Existing de-duplication solutions in cloud backup environment either obtain high compression ratios at the cost of heavy de-duplication overheads in terms of increased latency and reduced throughput, or maintain small de-duplication overheads at the cost of low compression ratios causing high data transmission costs, which results in a large backup window. In this paper, we present SAM, a Semantic-Aware Multitiered source de-duplication framework that first combines the global file-level de-duplication and local chunk-level deduplication, and further exploits file semantics in each stage in the framework, to obtain an optimal tradeoff between the deduplication efficiency and de-duplication overhead and finally achieve a shorter backup window than existing approaches. Our experimental results with real world datasets show that SAM not only has a higher de-duplication efficiency/overhead ratio than existing solutions, but also shortens the backup window by an average of 38.7%.
Keywords :
Internet; client-server systems; data compression; cloud backup environment; compression ratio; global file level deduplication; high data transmission cost; local chunk level deduplication; semantic aware multitiered source deduplication framework; Clouds; Data communication; Image coding; Indexes; Redundancy; Semantics; Servers; Backup Window; Cloud Backup; Data Deduplication; File Semantics;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel Processing (ICPP), 2010 39th International Conference on
Conference_Location :
San Diego, CA
ISSN :
0190-3918
Print_ISBN :
978-1-4244-7913-9
Electronic_ISBN :
0190-3918
Type :
conf
DOI :
10.1109/ICPP.2010.69
Filename :
5599246
Link To Document :
بازگشت