Title :
A Machine Learning Approach to SPARQL Query Performance Prediction
Author :
Hasan, Ragib ; Gandon, Fabien
Author_Institution :
Wimmics, INRIA Sophia Antipolis, Sophia-Antipolis, France
Abstract :
In this paper we address the problem of predicting SPARQL query performance. We use machine learning techniques to learn SPARQL query performance from previously executed queries. Traditional approaches for estimating SPARQL query cost are based on statistics about the underlying data. However, in many use-cases involving querying Linked Data, statistics about the underlying data are often missing. Our approach does not require any statistics about the underlying RDF data, which makes it ideal for the Linked Data scenario. We show how to model SPARQL queries as feature vectors, and use k-nearest neighbors regression and Support Vector Machine with the nu-SVR kernel to accurately predict SPARQL query execution time.
Keywords :
learning (artificial intelligence); query languages; query processing; regression analysis; support vector machines; SPARQL query execution time; SPARQL query performance prediction; feature vector; k-nearest neighbor regression; linked data; machine learning; nu-SVR kernel; statistics; support vector machine; Feature extraction; Measurement; Resource description framework; Support vector machines; Training; Vectors; Linked Data; SPARQL;
Conference_Titel :
Web Intelligence (WI) and Intelligent Agent Technologies (IAT), 2014 IEEE/WIC/ACM International Joint Conferences on
Conference_Location :
Warsaw
DOI :
10.1109/WI-IAT.2014.43