Title :
Design and implementation of various file deduplication schemes on storage devices
Author :
Yong-Ting Wu; Min-Chieh Yu; Jenq-Shiou Leu; Eau-Chung Lee; Tian Song
Author_Institution :
Department of Electronic and Computer Engineering, National Taiwan University of Science and Technology, Taipei, Taiwan
Abstract :
As the smart devices revolutionize, people may generate a lot of data and store the data in the local or remote file system in their daily lives. Even though the novel computer hardware and network technologies can handle the demand of generating a big volume of data, effective file deduplication can save storage space in either the private computing environment or the public cloud system. In the paper, we aim at designing and implementing various file deduplication schemes on storage device, which are based on different duplication checking rules, including file name, file size, and file full/partial content hash value. Comprehensive experiment results show that a partial content hashing based file deduplication can have a better trade-off between the computation cost and deduplication accuracy.
Keywords :
"Data structures","Accuracy","Time complexity","Computers","Cloud computing"
Conference_Titel :
Heterogeneous Networking for Quality, Reliability, Security and Robustness (QSHINE), 2015 11th International Conference on