DocumentCode :
2204868
Title :
Hardware accelerator for similarity based data dedupe
Author :
Dongyang Li ; Qingbo Wang ; Guyot, Cyril ; Narasimha, Ashwin ; Vucinic, Dejan ; Bandic, Zvonimir ; Qing Yang
Author_Institution :
Dept, of Electrical, Computer, and Biomedical Engineering, Univ of Rhode Island, Kingston, 02881, USA
fYear :
2015
fDate :
6-7 Aug. 2015
Firstpage :
224
Lastpage :
232
Abstract :
Data deduplication has proven important in backup storage systems as large amount of identical or similar data chunks exist. Recent studies have shown the great potential of data deduplication in primary storage and storage caches. Deduplications in these environments require high speed processing not to drag down production performance. This paper presents a hardware accelerator for similarity based data deduplication. It implements three compute-intensive kernel modules to improve throughput and latency in dedupe systems: sketch computation for data blocks, index searching for reference block, and delta encoding over similar blocks. Adopting pipelined computation and parallel data lookup across multiple hardware modules, our HW design is capable of processing high throughput data traffic by working on multiple data units concurrently, thus enabling wire speed dedupe for data stream where similar blocks present. Using a PC host system connected to the FPGA-based accelerator through a PCIe Gen 2×4 interface, our experiments show that the similarity based data dedupe performs 30% better in data reduction ratio than conventional dedupe techniques that look at identical blocks only. By comparing the hardware implementation with its software counterpart, the experimental results show that our preliminary FPGA implementation with maximum clock speed of 250MHz achieves at least 6 times improvement in latency over the software implementation running on state-of-art servers.
Keywords :
Cache; Data deduplication; Fingerprinting; PCIe; multiple lookup dictionary;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Networking, Architecture and Storage (NAS), 2015 IEEE International Conference on
Conference_Location :
Boston, MA, USA
Type :
conf
DOI :
10.1109/NAS.2015.7255198
Filename :
7255198
Link To Document :
بازگشت