DocumentCode
3756287
Title
BF-MapReduce: A Bloom Filter Based Efficient Lightweight Search
Author
Zi-long Tan;Ke-ren Zhou;Hao Zhang;Wei Zhou
Author_Institution
Sch. of Software, Yunnan Univ., Kunming, China
fYear
2015
Firstpage
125
Lastpage
129
Abstract
MapReduce is an attractive programming model for large-scale data-parallel applications. However, the original MapReduce framework also needs some optimizations to improve its performance. In this paper, we propose a novel bloom filter based lightweight MapReduce index (BF-MapReduce). Instead of scanning the whole dataset, our approach uses an auxiliary index to quickly skip unnecessary data segments, which can efficiently degrade the processing cost at map phase. Moreover, in order to deal with multi-dimension dataset, a converting schema is proposed. It can map multi-dimension data into one-dimension index. The experimental results show that our approach is efficient and lightweight. It can reduce the task running time dramatically with a little storage and maintenance cost.
Keywords
"Indexes","Optimization","Maintenance engineering","File systems","Internet","Information filters"
Publisher
ieee
Conference_Titel
Collaboration and Internet Computing (CIC), 2015 IEEE Conference on
Type
conf
DOI
10.1109/CIC.2015.47
Filename
7423074
Link To Document