• DocumentCode
    2435192
  • Title

    Heuristics to locate the best document set in information retrieval systems

  • Author

    Lucarella, D.

  • Author_Institution
    Dipartimento di Sci. dell´´Inf., Univ. degli Studi di Milano, Italy
  • fYear
    1989
  • fDate
    22-24 March 1989
  • Firstpage
    567
  • Lastpage
    571
  • Abstract
    The use of best-match search strategies in information retrieval systems is discussed. In response to a given query, best-match searching requires the identification of those documents in the collection which are most similar to the query, with similarity being measured by an appropriate closeness function. The emphasis is on heuristics to efficiently locate the closest documents set. The problem is introduced with reference to a straightforward search procedure that returns the best documents manipulating inverted index entries. An improved algorithm is presented which computes in advance an upper bound on closeness, avoiding the exact computation of closeness in many instances and thus optimizing both the number of documents to be evaluated and the number of inverted lists to be inspected. The algorithm is analyzed, and experimental results are reported.<>
  • Keywords
    information retrieval systems; best-match search strategies; closeness function; closest documents set; heuristics; information retrieval systems; inverted index; inverted lists; upper bound; Algorithm design and analysis; Information retrieval; Optical computing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computers and Communications, 1989. Conference Proceedings., Eighth Annual International Phoenix Conference on
  • Conference_Location
    Scottsdale, AZ, USA
  • Print_ISBN
    0-8186-1918-x
  • Type

    conf

  • DOI
    10.1109/PCCC.1989.37447
  • Filename
    37447