DocumentCode :
3862970
Title :
LERD, a locality enhanced and resemblance based deduplication scheme for large data sets
Author :
Panfeng Zhang;Ke Zhou;Hua Wang
Author_Institution :
School of Computer Science and Technology, Huazhong University of Science and Technology, Wuhan, China
fYear :
2015
Firstpage :
1
Lastpage :
4
Abstract :
As one kind of storage technology, deduplicaition is widely deployed in all kinds of storage systems. However, the key problems of duplication, such as data throughput and usage of RAM, have not been perfectly addressed. Especially, with the emergence of cloud storage, traditional deduplication methods are not able to adapt to the velocity characteristic of the large data sets. This paper proposes LERD, a temporal locality enhanced resemblance based Duplication scheme, aiming at rapidly querying duplicated data for large scale data sets. LERD takes advantage of data resemblance and temporal locality of data stream to narrow query range, which not only rise throughput, but also decline usage of RAM. Theoretical analysis and experimental results show that LERD´s performance is much better than other state-of-the-art schemes.
Keywords :
"Throughput","Fingerprint recognition","Indexes","Random access memory","Radio frequency","Bars","Electronic mail"
Publisher :
ieee
Conference_Titel :
Signal Processing, Communications and Computing (ICSPCC), 2015 IEEE International Conference on
Print_ISBN :
978-1-4799-8918-8
Type :
conf
DOI :
10.1109/ICSPCC.2015.7338962
Filename :
7338962
Link To Document :
بازگشت