Title :
DBA: A Dynamic Bloom Filter Array for Scalable Membership Representation of Variable Large Data Sets
Author :
Wei, Jiansheng ; Jiang, Hong ; Zhou, Ke ; Feng, Dan
Author_Institution :
Wuhan Nat. Lab. for Optoelectron., Huazhong Univ. of Sci. & Technol., Wuhan, China
Abstract :
This paper proposes a Dynamic Bloom filter Array (DBA) to represent membership for variable large data sets in storage systems in a scalable way. DBA consists of dynamically created groups of space-efficient Bloom Filters (BFs) to accommodate changes in set sizes. In each group, BFs are homogeneous and the data layout is optimized at the bit level, so that they can be accessed in parallel to achieve high query performance. DBA can effectively control its query accuracy by partially adjusting the error rate of constructing BFs, where each BF corresponds to an independent subset of the data set to facilitate element location and membership confirmation. Further, DBA supports element deletion by introducing a lazy update policy. We prototype and evaluate our DBA scheme as a scalable fast index in the MAD2 deduplication storage system. Experimental results show that DBA (with 64 BFs per group) is capable of maintaining 90% of the peek query performance while scaling up to 160 BFs. DBA is also shown to excel in performance and space efficiency by theoretical analysis and other experiments based on real-world data sets.
Keywords :
data structures; information filters; query processing; storage management; DBA; MAD2 deduplication storage system; dynamic bloom filter array; query accuracy; query performance; scalable membership representation; space-efficient Bloom filter; subset; variable large data sets; Accuracy; Arrays; Error analysis; Filtering theory; Fingerprint recognition; Indexes; Random access memory; Bloom filter; Data management; fast index; membership representation;
Conference_Titel :
Modeling, Analysis & Simulation of Computer and Telecommunication Systems (MASCOTS), 2011 IEEE 19th International Symposium on
Conference_Location :
Singapore
Print_ISBN :
978-1-4577-0468-0
DOI :
10.1109/MASCOTS.2011.36