• DocumentCode
    124134
  • Title

    Efficient Spatio-textual Similarity Join Using MapReduce

  • Author

    Yu Zhang ; Youzhong Ma ; Xiaofeng Meng

  • Author_Institution
    Sch. of Inf., Renmin Univ. of China, Beijing, China
  • Volume
    1
  • fYear
    2014
  • fDate
    11-14 Aug. 2014
  • Firstpage
    52
  • Lastpage
    59
  • Abstract
    Spatio-textual similarity join is a basic and significant operation in many applications. It is an operation that finds all the similar pairs of objects which have similar textual descriptions and are spatially close to each other. With the popularity of GPS and their applications, the size of spatiotextual data is increasing explosively, while the existing methods cannot deal with the spatio-textual similarity join efficiently on massive data. In this paper, we propose several approaches for spatio-textual similarity join using MapReduce. We use the prefix filtering and grid partitioning techniques to filter the spatiotextual objects under the filter-and-refine framework. Besides, we propose two kinds of optimization methods to improve the efficiency of the basic spatio-textual similarity join method. In the end, we conduct extensive experiments using several synthetic datasets that are comprised of real datasets, and the results show that our approaches have good performance in both efficiency and scalability.
  • Keywords
    Global Positioning System; information filtering; mobile computing; parallel processing; text analysis; GPS; MapReduce; filter-and-refine framework; grid partitioning techniques; prefix filtering techniques; spatio-textual data; spatio-textual objects; spatio-textual similarity join; Algorithm design and analysis; Global Positioning System; Heuristic algorithms; Partitioning algorithms; Spatial indexes; Twitter; MapReduce; cell prefixes; kdtree; spatio-textual similarity join;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Web Intelligence (WI) and Intelligent Agent Technologies (IAT), 2014 IEEE/WIC/ACM International Joint Conferences on
  • Conference_Location
    Warsaw
  • Type

    conf

  • DOI
    10.1109/WI-IAT.2014.16
  • Filename
    6927525