• DocumentCode
    82693
  • Title

    Efficient Prediction of Difficult Keyword Queries over Databases

  • Author

    Shiwen Cheng ; Termehchy, Arash ; Hristidis, Vagelis

  • Author_Institution
    Univ. of California, Riverside, Riverside, CA, USA
  • Volume
    26
  • Issue
    6
  • fYear
    2014
  • fDate
    Jun-14
  • Firstpage
    1507
  • Lastpage
    1520
  • Abstract
    Keyword queries on databases provide easy access to data, but often suffer from low ranking quality, i.e., low precision and/or recall, as shown in recent benchmarks. It would be useful to identify queries that are likely to have low ranking quality to improve the user satisfaction. For instance, the system may suggest to the user alternative queries for such hard queries. In this paper, we analyze the characteristics of hard queries and propose a novel framework to measure the degree of difficulty for a keyword query over a database, considering both the structure and the content of the database and the query results. We evaluate our query difficulty prediction model against two effectiveness benchmarks for popular keyword search ranking methods. Our empirical results show that our model predicts the hard queries with high accuracy. Further, we present a suite of optimizations to minimize the incurred time overhead.
  • Keywords
    query processing; alternative queries; database content; difficult keyword queries; keyword search ranking methods; query difficulty prediction model; query identification; user satisfaction; Benchmark testing; Databases; Motion pictures; Noise; Noise measurement; Probability distribution; Robustness; Query performance; Relational databases; Search process; databases; keyword query; query effectiveness; robustness;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/TKDE.2013.140
  • Filename
    6579590