Title :
Scalable collective spatial keyword query
Author :
Peijun He ; Hao Xu ; Xiang Zhao ; Zhitao Shen
Author_Institution :
Coll. of Inf. Syst. & Manage., Nat. Univ. of Defense Technol., Changsha, China
Abstract :
Spatial keyword queries have been widely studied recently, along with the emergence of large amount of geo-textual data. We consider the problem of scalable collective spatial keyword queries in this paper. Such query has a wide spectrum of applications; for instance, to find the best (nearest) area to organize a friend get-together where bars, restaurants and accommodations are nearby, and compose a group of members from different professional domains, e.g., computing, accounting, etc, for a specific task, etc. While existing algorithms processes the queries well, we observe their shortcomings in handling large-scale datasets. To this end, we propose a distributed solution following Spark programming paradigm. Moreover, a grid-based optimization technique is further proposed to enhance the efficiency. Extensive experiments on various datasets confirm that the proposed algorithm efficiently solves the problem at scale.
Keywords :
distributed programming; query processing; Spark programming paradigm; geo-textual data; grid-based optimization technique; large-scale dataset handling; scalable collective spatial keyword query; Cost function; Distributed databases; Indexes; Partitioning algorithms; Programming; Query processing; Sparks;
Conference_Titel :
Data Engineering Workshops (ICDEW), 2015 31st IEEE International Conference on
Conference_Location :
Seoul
DOI :
10.1109/ICDEW.2015.7129574