• DocumentCode
    3536888
  • Title

    A Parallel Architecture for In-Line Data De-duplication

  • Author

    Sengar, Seetendra Singh ; Mishra, Manoj

  • Author_Institution
    Dept. of Electron. & Comput. Eng., Indian Inst. of Technol., Roorkee, India
  • fYear
    2012
  • fDate
    7-8 Jan. 2012
  • Firstpage
    399
  • Lastpage
    403
  • Abstract
    Recently, data de-duplication, the hot emerging technology, has received a broad attention from both academia and industry. Some researches focus on the approach by which more redundant data can be reduced and others investigate how to do data de-duplication at high speed. In this paper, we show the importance of data de-duplication in the current digital world and aim at reducing the time and space requirement for data de-duplication. Then, we present a parallel architecture with one node designated as a server and multiple storage nodes. All the nodes, including the server, can do block level in-line de-duplication in parallel. We have built a prototype of the system and present some performance results. The proposed system uses magnetic disks as a storage technology.
  • Keywords
    data compression; parallel architectures; in-line data de-duplication; parallel architecture; redundant data; Computer architecture; Databases; Electronic mail; Industries; Java; Redundancy; Servers; cluster; data de-duplication; hash signature; in-line de-duplication; load sharing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Advanced Computing & Communication Technologies (ACCT), 2012 Second International Conference on
  • Conference_Location
    Rohtak, Haryana
  • Print_ISBN
    978-1-4673-0471-9
  • Type

    conf

  • DOI
    10.1109/ACCT.2012.10
  • Filename
    6168400