Title :
Efficient SPARQL Query Evaluation in a Database Cluster
Author :
Fang Du ; Haoqiong Bian ; Yueguo Chen ; Xiaoyong Du
Author_Institution :
DEKE Lab., Renmin Univ. of China, Beijing, China
fDate :
June 27 2013-July 2 2013
Abstract :
Efficient SPARQL query evaluation is a significant challenge when the database contains billions of RDF triples, which is very common for many existing Web-scale RDF data sources. We address this challenge by 1) effectively partitioning the whole RDF dataset into small partitions according to the schemas of the RDF subjects, and 2) elaborately placing the partitions within clusters so that, on each local partition, we can make the most advantage of the state-of-the-art SPARQL query processing engine, and across the partitions, we can exploit the power of parallel databases for achieving scalable query evaluation of massive RDF data. This paper introduces the data partitioning and placement strategies, as well as the SPARQL query evaluation and optimization techniques in a cluster environment. Experiments are conducted over a synthesized dataset and a real dataset containing billions of triples. The results demonstrate that better query evaluation performance over the baseline can be achieved.
Keywords :
query processing; relational databases; SPARQL query evaluation; SPARQL query optimization technique; Web-scale RDF data source; data partitioning strategy; data placement strategy; database cluster; query evaluation; resource description framework; Engines; Indexes; Optimization; Partitioning algorithms; Query processing; Resource description framework; RDF; SPARQL query; data partitioning; parallel database;
Conference_Titel :
Big Data (BigData Congress), 2013 IEEE International Congress on
Conference_Location :
Santa Clara, CA
Print_ISBN :
978-0-7695-5006-0
DOI :
10.1109/BigData.Congress.2013.30