DocumentCode
124134
Title
Efficient Spatio-textual Similarity Join Using MapReduce
Author
Yu Zhang ; Youzhong Ma ; Xiaofeng Meng
Author_Institution
Sch. of Inf., Renmin Univ. of China, Beijing, China
Volume
1
fYear
2014
fDate
11-14 Aug. 2014
Firstpage
52
Lastpage
59
Abstract
Spatio-textual similarity join is a basic and significant operation in many applications. It is an operation that finds all the similar pairs of objects which have similar textual descriptions and are spatially close to each other. With the popularity of GPS and their applications, the size of spatiotextual data is increasing explosively, while the existing methods cannot deal with the spatio-textual similarity join efficiently on massive data. In this paper, we propose several approaches for spatio-textual similarity join using MapReduce. We use the prefix filtering and grid partitioning techniques to filter the spatiotextual objects under the filter-and-refine framework. Besides, we propose two kinds of optimization methods to improve the efficiency of the basic spatio-textual similarity join method. In the end, we conduct extensive experiments using several synthetic datasets that are comprised of real datasets, and the results show that our approaches have good performance in both efficiency and scalability.
Keywords
Global Positioning System; information filtering; mobile computing; parallel processing; text analysis; GPS; MapReduce; filter-and-refine framework; grid partitioning techniques; prefix filtering techniques; spatio-textual data; spatio-textual objects; spatio-textual similarity join; Algorithm design and analysis; Global Positioning System; Heuristic algorithms; Partitioning algorithms; Spatial indexes; Twitter; MapReduce; cell prefixes; kdtree; spatio-textual similarity join;
fLanguage
English
Publisher
ieee
Conference_Titel
Web Intelligence (WI) and Intelligent Agent Technologies (IAT), 2014 IEEE/WIC/ACM International Joint Conferences on
Conference_Location
Warsaw
Type
conf
DOI
10.1109/WI-IAT.2014.16
Filename
6927525
Link To Document