Title :
WARP: Workload-aware replication and partitioning for RDF
Author :
Hose, K. ; Schenkel, Ralf
Author_Institution :
Dept. of Comput. Sci., Aalborg Univ., Aalborg, Denmark
Abstract :
With the increasing popularity of the Semantic Web, more and more data becomes available in RDF with SPARQL as a query language. Data sets, however, can become too big to be managed and queried on a single server in a scalable way. Existing distributed RDF stores approach this problem using data partitioning, aiming at limiting the communication between servers and exploiting parallelism. This paper proposes a distributed SPARQL engine that combines a graph partitioning technique with workload-aware replication of triples across partitions, enabling efficient query execution even for complex queries from the workload. Furthermore, it discusses query optimization techniques for producing efficient execution plans for ad-hoc queries not contained in the workload.
Keywords :
SQL; graph theory; pattern classification; query processing; replicated databases; semantic Web; ad hoc query; data partitioning; data server; data set management; distributed RDF; distributed SPARQL engine; graph partitioning technique; query execution; query language; query optimization technique; semantic Web; workload aware replication and partitioning; Distributed databases; Optimization; Parallel processing; Query processing; Resource description framework; Servers;
Conference_Titel :
Data Engineering Workshops (ICDEW), 2013 IEEE 29th International Conference on
Conference_Location :
Brisbane, QLD
Print_ISBN :
978-1-4673-5303-8
Electronic_ISBN :
978-1-4673-5302-1
DOI :
10.1109/ICDEW.2013.6547414