• DocumentCode
    2082434
  • Title

    Ranked queries over sources with Boolean query interfaces without ranking support

  • Author

    Hristidis, Vagelis ; Hu, Yuheng ; Ipeirotis, Panagiotis G.

  • Author_Institution
    Sch. of Comput. & Inf. Sci., Florida Int. Univ., Miami, FL, USA
  • fYear
    2010
  • fDate
    1-6 March 2010
  • Firstpage
    872
  • Lastpage
    875
  • Abstract
    Many online or local data sources provide powerful querying mechanisms but limited ranking capabilities. For instance, PubMed allows users to submit highly expressive Boolean keyword queries, but ranks the query results by date only. However, a user would typically prefer a ranking by relevance, measured by an Information Retrieval (IR) ranking function. The naive approach would be to submit a disjunctive query with all query keywords, retrieve the returned documents, and then re-rank them. Unfortunately, such an operation would be very expensive due to the large number of results returned by disjunctive queries. In this paper we present algorithms that return the top results for a query, ranked according to an IR-style ranking function, while operating on top of a source with a Boolean query interface with no ranking capabilities (or a ranking capability of no interest to the end user). The algorithms generate a series of conjunctive queries that return only documents that are candidates for being highly ranked according to a relevance metric. Our approach can also be applied to other settings where the ranking is monotonic on a set of factors (query keywords in IR) and the source query interface is a Boolean expression of these factors. Our comprehensive experimental evaluation on the PubMed database and TREC dataset show that we achieve order of magnitude improvement compared to the current baseline approaches.
  • Keywords
    query processing; relevance feedback; user interfaces; Boolean query interfaces; PubMed algorithm; conjunctive queries; disjunctive query; information retrieval ranking function; query keywords; query ranking; querying mechanisms; ranking support; relevance metric; relevance ranking; Biomedical computing; Biomedical measurements; Computer interfaces; Databases; Immune system; Information management; Information retrieval; Optical computing; Trademarks; Web services;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Engineering (ICDE), 2010 IEEE 26th International Conference on
  • Conference_Location
    Long Beach, CA
  • Print_ISBN
    978-1-4244-5445-7
  • Electronic_ISBN
    978-1-4244-5444-0
  • Type

    conf

  • DOI
    10.1109/ICDE.2010.5447918
  • Filename
    5447918