DocumentCode
124161
Title
A Machine Learning Approach to SPARQL Query Performance Prediction
Author
Hasan, Ragib ; Gandon, Fabien
Author_Institution
Wimmics, INRIA Sophia Antipolis, Sophia-Antipolis, France
Volume
1
fYear
2014
fDate
11-14 Aug. 2014
Firstpage
266
Lastpage
273
Abstract
In this paper we address the problem of predicting SPARQL query performance. We use machine learning techniques to learn SPARQL query performance from previously executed queries. Traditional approaches for estimating SPARQL query cost are based on statistics about the underlying data. However, in many use-cases involving querying Linked Data, statistics about the underlying data are often missing. Our approach does not require any statistics about the underlying RDF data, which makes it ideal for the Linked Data scenario. We show how to model SPARQL queries as feature vectors, and use k-nearest neighbors regression and Support Vector Machine with the nu-SVR kernel to accurately predict SPARQL query execution time.
Keywords
learning (artificial intelligence); query languages; query processing; regression analysis; support vector machines; SPARQL query execution time; SPARQL query performance prediction; feature vector; k-nearest neighbor regression; linked data; machine learning; nu-SVR kernel; statistics; support vector machine; Feature extraction; Measurement; Resource description framework; Support vector machines; Training; Vectors; Linked Data; SPARQL;
fLanguage
English
Publisher
ieee
Conference_Titel
Web Intelligence (WI) and Intelligent Agent Technologies (IAT), 2014 IEEE/WIC/ACM International Joint Conferences on
Conference_Location
Warsaw
Type
conf
DOI
10.1109/WI-IAT.2014.43
Filename
6927552
Link To Document