DocumentCode
3006558
Title
Efficient SPARQL Query Evaluation in a Database Cluster
Author
Fang Du ; Haoqiong Bian ; Yueguo Chen ; Xiaoyong Du
Author_Institution
DEKE Lab., Renmin Univ. of China, Beijing, China
fYear
2013
fDate
June 27 2013-July 2 2013
Firstpage
165
Lastpage
172
Abstract
Efficient SPARQL query evaluation is a significant challenge when the database contains billions of RDF triples, which is very common for many existing Web-scale RDF data sources. We address this challenge by 1) effectively partitioning the whole RDF dataset into small partitions according to the schemas of the RDF subjects, and 2) elaborately placing the partitions within clusters so that, on each local partition, we can make the most advantage of the state-of-the-art SPARQL query processing engine, and across the partitions, we can exploit the power of parallel databases for achieving scalable query evaluation of massive RDF data. This paper introduces the data partitioning and placement strategies, as well as the SPARQL query evaluation and optimization techniques in a cluster environment. Experiments are conducted over a synthesized dataset and a real dataset containing billions of triples. The results demonstrate that better query evaluation performance over the baseline can be achieved.
Keywords
query processing; relational databases; SPARQL query evaluation; SPARQL query optimization technique; Web-scale RDF data source; data partitioning strategy; data placement strategy; database cluster; query evaluation; resource description framework; Engines; Indexes; Optimization; Partitioning algorithms; Query processing; Resource description framework; RDF; SPARQL query; data partitioning; parallel database;
fLanguage
English
Publisher
ieee
Conference_Titel
Big Data (BigData Congress), 2013 IEEE International Congress on
Conference_Location
Santa Clara, CA
Print_ISBN
978-0-7695-5006-0
Type
conf
DOI
10.1109/BigData.Congress.2013.30
Filename
6597133
Link To Document