DocumentCode :
710124
Title :
Executing queries over schemaless RDF databases
Author :
Aluc, Gunes ; Ozsu, M. Tamer ; Daudjee, Khuzaima ; Hartig, Olaf
Author_Institution :
Cheriton Sch. of Comput. Sci., Univ. of Waterloo, Waterloo, ON, Canada
fYear :
2015
fDate :
13-17 April 2015
Firstpage :
807
Lastpage :
818
Abstract :
Recent advances in Linked Data Management and the Semantic Web have led to a rapid increase in both the quantity as well as the variety of Web applications that rely on the SPARQL interface to query RDF data. Thus, RDF data management systems are increasingly exposed to workloads that are far more diverse and dynamic than what these systems were designed to handle. The problem is that existing systems rely on a workload-oblivious physical representation that has a fixed schema, which is not suitable for diverse and dynamic workloads. To address these issues, we propose a physical representation that is schemaless. The resulting flexibility enables an RDF dataset to be clustered based purely on the workload, which is key to achieving good performance through optimized I/O and cache utilization. Consequently, given a workload, we develop techniques to compute a good clustering of the database. We also design a new query evaluation model, namely, schemaless-evaluation that leverages this workload-aware clustering of the database whereby, with high probability, each tuple in the result set of a query is expected to be contained in at most one cluster. Our query evaluation model exploits this property to achieve better performance while ensuring fast generation of query plans without being hindered by the lack of a fixed physical schema.
Keywords :
cache storage; database management systems; pattern clustering; query processing; semantic Web; RDF data management systems; RDF dataset; SPARQL interface; Web applications; cache utilization; dynamic workloads; linked data management; query evaluation model; query execution; schemaless RDF databases; semantic Web; workload-aware clustering; workload-oblivious physical representation; Clustering algorithms; Computational modeling; Indexes; Query processing; Resource description framework; Standards;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Engineering (ICDE), 2015 IEEE 31st International Conference on
Conference_Location :
Seoul
Type :
conf
DOI :
10.1109/ICDE.2015.7113335
Filename :
7113335
Link To Document :
بازگشت