• DocumentCode
    3673770
  • Title

    Improving storage capacity by distributed exact deduplication systems

  • Author

    Cristian Barca;Dan Claudiu Barca;Constantin Mara;Petre Anghelescu;Bogdan Gavriloaia;Radu Vizireanu;Razvan Craciunescu;Octavian Fratu

  • Author_Institution
    Faculty of Electronics, Communications and Computers, University of Pitesti, Roumania
  • fYear
    2015
  • fDate
    6/1/2015 12:00:00 AM
  • Abstract
    The topic of data deduplication has received lately a lot of attention for its storage reduction functionality. Data deduplication essentially refers to the elimination of redundant data, leaving only one copy of the data to be stored, and is meant to reduce the pain regarding the exponential data growth in backup or archiving centers. Most existing state-of-the-art deduplication systems rely on approximate deduplication in order to achieve high-performance. Unfortunately, these studies are usually conducted and tested on single-host systems. Although their authors claim that the design can be easily applied on multi-node systems, we have not seen yet an extension that enacts that - they lack of trust. Thus, in a world where data deduplication storage systems are continuously struggling in providing the required throughput and disk capacities necessary to store and retrieve data within reasonable times, we are handled the task to design a distributed deduplication systems that will achieve efficiency, scalability and throughput at a petascale capacity level. In this paper we present a proof-of-concept design that one can use to implement such a system: A Distributed Exact Deduplication System, which we believe it will cross the boundaries towards a new generation of backup and archiving systems.
  • Keywords
    "Fingerprint recognition","Containers","Metadata","Indexes","Throughput","Random access memory","Scalability"
  • Publisher
    ieee
  • Conference_Titel
    Electronics, Computers and Artificial Intelligence (ECAI), 2015 7th International Conference on
  • Print_ISBN
    978-1-4673-6646-5
  • Type

    conf

  • DOI
    10.1109/ECAI.2015.7301141
  • Filename
    7301141