DocumentCode :
1379146
Title :
False Negative Problem of Counting Bloom Filter
Author :
Guo, Deke ; Liu, Yunhao ; Li, XiangYang ; Yang, Panlong
Author_Institution :
Key Lab. of C4ISR Technol., Nat. Univ. of Defense Technol., Changsha, China
Volume :
22
Issue :
5
fYear :
2010
fDate :
5/1/2010 12:00:00 AM
Firstpage :
651
Lastpage :
664
Abstract :
Bloom filter is effective, space-efficient data structure for concisely representing a data set and supporting approximate membership queries. Traditionally, researchers often believe that it is possible that a Bloom filter returns a false positive, but it will never return a false negative under well-behaved operations. By investigating the mainstream variants, however, we observe that a Bloom filter does return false negatives in many scenarios. In this work, we show that the undetectable incorrect deletion of false positive items and detectable incorrect deletion of multiaddress items are two general causes of false negative in a Bloom filter. We then measure the potential and exposed false negatives theoretically and practically. Inspired by the fact that the potential false negatives are usually not fully exposed, we propose a novel Bloom filter scheme, which increases the ratio of bits set to a value larger than one without decreasing the ratio of bits set to zero. Mathematical analysis and comprehensive experiments show that this design can reduce the number of exposed false negatives as well as decrease the likelihood of false positives. To the best of our knowledge, this is the first work dealing with both the false positive and false negative problems of Bloom filter systematically when supporting standard usages of item insertion, query, and deletion operations.
Keywords :
data structures; information filtering; mathematical analysis; query processing; approximate membership queries; counting bloom filter scheme; data set representation; false negative problem; false positive items; mathematical analysis; multiaddress item deletion; space-efficient data structure; Bloom filter; false negative; multichoice counting Bloom filter.;
fLanguage :
English
Journal_Title :
Knowledge and Data Engineering, IEEE Transactions on
Publisher :
ieee
ISSN :
1041-4347
Type :
jour
DOI :
10.1109/TKDE.2009.209
Filename :
5374398
Link To Document :
بازگشت