DocumentCode :
3143780
Title :
Characteristic sets: Accurate cardinality estimation for RDF queries with multiple joins
Author :
Neumann, Thomas ; Moerkotte, Guido
Author_Institution :
Tech. Univ. Munchen, Munich, Germany
fYear :
2011
fDate :
11-16 April 2011
Firstpage :
984
Lastpage :
994
Abstract :
Accurate cardinality estimates are essential for a successful query optimization. This is not only true for relational DBMSs but also for RDF stores. An RDF database consists of a set of triples and, hence, can be seen as a relational database with a single table with three attributes. This makes RDF rather special in that queries typically contain many self joins. We show that relational DBMSs are not well-prepared to perform cardinality estimation in this context. Further, there are hardly any special cardinality estimation methods for RDF databases. To overcome this lack of appropriate cardinality estimation methods, we introduce characteristic sets together with new cardinality estimation methods based upon them. We then show experimentally that the new methods are-in the RDF context-highly superior to the estimation methods employed by commercial DBMSs and by the open-source RDF store RDF-3X.
Keywords :
data models; optimisation; public domain software; query processing; relational databases; RDF database; RDF query; RDF-3X; cardinality estimation; characteristic set; open source RDF; query optimization; relational DBMS; relational database; Accuracy; Books; Correlation; Estimation; Histograms; Resource description framework; Semantics;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Engineering (ICDE), 2011 IEEE 27th International Conference on
Conference_Location :
Hannover
ISSN :
1063-6382
Print_ISBN :
978-1-4244-8959-6
Electronic_ISBN :
1063-6382
Type :
conf
DOI :
10.1109/ICDE.2011.5767868
Filename :
5767868
Link To Document :
بازگشت