DocumentCode
659427
Title
DDSN: Duplicate detection to reduce both storage and bandwidth consumption
Author
Jiaran Zhang ; Xiaohui Yu ; Yang Liu ; Liwei Lin
Author_Institution
Sch. of Comput. Sci. & Technol., Shandong Univ., Jinan, China
fYear
2013
fDate
6-9 Oct. 2013
Firstpage
206
Lastpage
211
Abstract
As highly centralized storage facilities are gaining popularity, duplicate detection becomes a critical problem. Traditional methods focus on reducing the storage space consumption; however, for network storage system with remote clients, the network overhead cannot be ignored, especially when the system is accessed over WAN. We propose a new duplicate detection method and implement a network file system prototype called DDSN based on this new method. It can reach the same performance in terms of storage space consumption as the state-of-the-art sliding blocking method. Meanwhile, our method overcomes its drawback that the whole file needs to be transmitted over the network, and therefore saves massive bandwidth for duplicate data. Experiments confirm the effectiveness of the proposed method.
Keywords
storage area networks; wide area networks; DDSN; WAN; bandwidth consumption; duplicate data; duplicate detection method; massive bandwidth; network file system prototype; network overhead; network storage system; reduce both storage; remote clients; sliding blocking method; storage facilities; storage space consumption; Bandwidth; Electronic mail; Hard disks; Indexes; Prototypes; Servers; Wide area networks; Duplicate Detection; Network File System;
fLanguage
English
Publisher
ieee
Conference_Titel
Big Data, 2013 IEEE International Conference on
Conference_Location
Silicon Valley, CA
Type
conf
DOI
10.1109/BigData.2013.6691576
Filename
6691576
Link To Document