• DocumentCode
    1793522
  • Title

    Accelerating duplicate data chunk recognition using NN trained by locality-sensitive hash

  • Author

    Berman, Amit ; Birk, Yitzhak ; Mendelson, Avi

  • Author_Institution
    Electr. Eng. Dept., Technion - Israel Inst. of Technol., Haifa, Israel
  • fYear
    2014
  • fDate
    3-5 Dec. 2014
  • Firstpage
    1
  • Lastpage
    5
  • Abstract
    Deduplication is often used in storage systems in order to save storage space, communication bandwidth, write energy, and recovery and error-protection infrastructure. However, deduplication overhead increases latency and computation energy. Determining whether a data chunk is already stored by comparing signatures constitutes a significant fraction of this deduplication overhead. In this paper, we propose a statistical chunk classifier based on a neural network. Our technique is based on learning the patterns of locality-sensitive hashing of the data. Our experiments show an acceleration of chunk processing, leading to reduction in deduplication overhead.
  • Keywords
    file organisation; neural nets; pattern classification; NN; communication bandwidth; computation energy; deduplication overhead; duplicate data chunk recognition; error-protection infrastructure; locality-sensitive hash; locality-sensitive hashing; neural network; statistical chunk classifier; storage systems; write energy; Acceleration; Artificial neural networks; Biological neural networks; Computer architecture; Neurons; Training; Chunking; Cloud Storage; Deduplication; Locality-Sensitive Hashing; Machine Learning; Neural Network;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Electrical & Electronics Engineers in Israel (IEEEI), 2014 IEEE 28th Convention of
  • Conference_Location
    Eilat
  • Print_ISBN
    978-1-4799-5987-7
  • Type

    conf

  • DOI
    10.1109/EEEI.2014.7005887
  • Filename
    7005887