Title :
E-DAID: An Efficient Distributed Architecture for In-Line Data De-duplication
Author :
Sengar, Seetendra Singh ; Mishra, Manoj
Author_Institution :
Electron. & Comput. Eng. Dept., Indian Inst. of Technol. Roorkee, Roorkee, India
Abstract :
As data have been growing rapidly in data centers, data de-duplication, a form of compression, has become an important need of most commercial and research backup systems. Currently, data de-duplication storage systems continuously facing challenges in providing the required throughputs and capacities necessary to move backup data within backup and recovery window times. In this paper, we are presenting a distributed architecture for in-line data de-duplication with one node designated as server and multiple storage nodes. In proposed architecture, we used an Intelligent Storage Balancing Strategy to distribute the data among the storage nodes to improve the de-duplication efficiency. All the nodes, including the server can do block level de-duplication in parallel. Proposed architecture can de-duplicate with high throughput, support de-duplication ratio comparable to that of a single system. And in the last section of this paper, we are introducing a technique called Sampled Hashing for improving the scalability of the architecture.
Keywords :
client-server systems; data compression; parallel architectures; resource allocation; storage management; E-DAID; backup data; block level de-duplication; data centers; data compression; data distribution; de-duplication ratio; distributed architecture scalability; in-line data de-duplication storage system; intelligent storage balancing strategy; multiple storage nodes; recovery window times; sampled hashing; Computer architecture; Conferences; Distributed databases; Indexes; Scalability; Servers; Throughput; data de-duplication; hash signature; in-line de-duplication; load sharing;
Conference_Titel :
Communication Systems and Network Technologies (CSNT), 2012 International Conference on
Conference_Location :
Rajkot
Print_ISBN :
978-1-4673-1538-8
DOI :
10.1109/CSNT.2012.101