• DocumentCode
    124161
  • Title

    A Machine Learning Approach to SPARQL Query Performance Prediction

  • Author

    Hasan, Ragib ; Gandon, Fabien

  • Author_Institution
    Wimmics, INRIA Sophia Antipolis, Sophia-Antipolis, France
  • Volume
    1
  • fYear
    2014
  • fDate
    11-14 Aug. 2014
  • Firstpage
    266
  • Lastpage
    273
  • Abstract
    In this paper we address the problem of predicting SPARQL query performance. We use machine learning techniques to learn SPARQL query performance from previously executed queries. Traditional approaches for estimating SPARQL query cost are based on statistics about the underlying data. However, in many use-cases involving querying Linked Data, statistics about the underlying data are often missing. Our approach does not require any statistics about the underlying RDF data, which makes it ideal for the Linked Data scenario. We show how to model SPARQL queries as feature vectors, and use k-nearest neighbors regression and Support Vector Machine with the nu-SVR kernel to accurately predict SPARQL query execution time.
  • Keywords
    learning (artificial intelligence); query languages; query processing; regression analysis; support vector machines; SPARQL query execution time; SPARQL query performance prediction; feature vector; k-nearest neighbor regression; linked data; machine learning; nu-SVR kernel; statistics; support vector machine; Feature extraction; Measurement; Resource description framework; Support vector machines; Training; Vectors; Linked Data; SPARQL;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Web Intelligence (WI) and Intelligent Agent Technologies (IAT), 2014 IEEE/WIC/ACM International Joint Conferences on
  • Conference_Location
    Warsaw
  • Type

    conf

  • DOI
    10.1109/WI-IAT.2014.43
  • Filename
    6927552