DocumentCode
3032096
Title
An Undirected Graph Traversal Based Grouping Prediction Method for Data De-duplication
Author
Longxiang Wang ; Xingjun Zhang ; Guofeng Zhu ; Yueguang Zhu ; Xiaoshe Dong
Author_Institution
Dept. of Comput. Sci. & Technol., Xi´an Jiaotong Univ., Xi´an, China
fYear
2013
fDate
1-3 July 2013
Firstpage
3
Lastpage
8
Abstract
The data capacity of the de-duplication system, which is limited by the memory, is difficult to carry out a large-scale expansion. To solve this problem, the paper proposes a hash table grouping prediction method based on undirected graph traversal. This method exploits the indexing table replacement, which is similar to the virtual memory cache replacement, to expand the storage capacity of the data de-duplication system without increasing the system memory. The hit rate of the grouping prediction and system performance are improved by grouping index entries based on undirected graph traversal. Experimental results show that, based on the cache prefetching and the hash table grouping, the memory consuming takes up 10% of the index table size while the capacity equally rise to 10 times of original. The method can make the index table cache hit rate increased to 87.6%, comparing 47% without group in dataset 1 of our experiment, make the performance acceptable.
Keywords
data handling; graph theory; cache prefetching; data deduplication system; hash table grouping prediction method; index entry grouping; undirected graph traversal; virtual memory cache replacement; Data structures; Indexes; Integrated circuits; Prediction methods; Prefetching; Sparse matrices; System performance; Data de-duplication; grouping prediction; large-scale storage system;
fLanguage
English
Publisher
ieee
Conference_Titel
Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing (SNPD), 2013 14th ACIS International Conference on
Conference_Location
Honolulu, HI
Type
conf
DOI
10.1109/SNPD.2013.34
Filename
6598437
Link To Document