Title :
LERD, a locality enhanced and resemblance based deduplication scheme for large data sets
Author :
Panfeng Zhang;Ke Zhou;Hua Wang
Author_Institution :
School of Computer Science and Technology, Huazhong University of Science and Technology, Wuhan, China
Abstract :
As one kind of storage technology, deduplicaition is widely deployed in all kinds of storage systems. However, the key problems of duplication, such as data throughput and usage of RAM, have not been perfectly addressed. Especially, with the emergence of cloud storage, traditional deduplication methods are not able to adapt to the velocity characteristic of the large data sets. This paper proposes LERD, a temporal locality enhanced resemblance based Duplication scheme, aiming at rapidly querying duplicated data for large scale data sets. LERD takes advantage of data resemblance and temporal locality of data stream to narrow query range, which not only rise throughput, but also decline usage of RAM. Theoretical analysis and experimental results show that LERD´s performance is much better than other state-of-the-art schemes.
Keywords :
"Throughput","Fingerprint recognition","Indexes","Random access memory","Radio frequency","Bars","Electronic mail"
Conference_Titel :
Signal Processing, Communications and Computing (ICSPCC), 2015 IEEE International Conference on
Print_ISBN :
978-1-4799-8918-8
DOI :
10.1109/ICSPCC.2015.7338962