• DocumentCode
    3756287
  • Title

    BF-MapReduce: A Bloom Filter Based Efficient Lightweight Search

  • Author

    Zi-long Tan;Ke-ren Zhou;Hao Zhang;Wei Zhou

  • Author_Institution
    Sch. of Software, Yunnan Univ., Kunming, China
  • fYear
    2015
  • Firstpage
    125
  • Lastpage
    129
  • Abstract
    MapReduce is an attractive programming model for large-scale data-parallel applications. However, the original MapReduce framework also needs some optimizations to improve its performance. In this paper, we propose a novel bloom filter based lightweight MapReduce index (BF-MapReduce). Instead of scanning the whole dataset, our approach uses an auxiliary index to quickly skip unnecessary data segments, which can efficiently degrade the processing cost at map phase. Moreover, in order to deal with multi-dimension dataset, a converting schema is proposed. It can map multi-dimension data into one-dimension index. The experimental results show that our approach is efficient and lightweight. It can reduce the task running time dramatically with a little storage and maintenance cost.
  • Keywords
    "Indexes","Optimization","Maintenance engineering","File systems","Internet","Information filters"
  • Publisher
    ieee
  • Conference_Titel
    Collaboration and Internet Computing (CIC), 2015 IEEE Conference on
  • Type

    conf

  • DOI
    10.1109/CIC.2015.47
  • Filename
    7423074