• DocumentCode
    659427
  • Title

    DDSN: Duplicate detection to reduce both storage and bandwidth consumption

  • Author

    Jiaran Zhang ; Xiaohui Yu ; Yang Liu ; Liwei Lin

  • Author_Institution
    Sch. of Comput. Sci. & Technol., Shandong Univ., Jinan, China
  • fYear
    2013
  • fDate
    6-9 Oct. 2013
  • Firstpage
    206
  • Lastpage
    211
  • Abstract
    As highly centralized storage facilities are gaining popularity, duplicate detection becomes a critical problem. Traditional methods focus on reducing the storage space consumption; however, for network storage system with remote clients, the network overhead cannot be ignored, especially when the system is accessed over WAN. We propose a new duplicate detection method and implement a network file system prototype called DDSN based on this new method. It can reach the same performance in terms of storage space consumption as the state-of-the-art sliding blocking method. Meanwhile, our method overcomes its drawback that the whole file needs to be transmitted over the network, and therefore saves massive bandwidth for duplicate data. Experiments confirm the effectiveness of the proposed method.
  • Keywords
    storage area networks; wide area networks; DDSN; WAN; bandwidth consumption; duplicate data; duplicate detection method; massive bandwidth; network file system prototype; network overhead; network storage system; reduce both storage; remote clients; sliding blocking method; storage facilities; storage space consumption; Bandwidth; Electronic mail; Hard disks; Indexes; Prototypes; Servers; Wide area networks; Duplicate Detection; Network File System;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Big Data, 2013 IEEE International Conference on
  • Conference_Location
    Silicon Valley, CA
  • Type

    conf

  • DOI
    10.1109/BigData.2013.6691576
  • Filename
    6691576