DocumentCode
2435192
Title
Heuristics to locate the best document set in information retrieval systems
Author
Lucarella, D.
Author_Institution
Dipartimento di Sci. dell´´Inf., Univ. degli Studi di Milano, Italy
fYear
1989
fDate
22-24 March 1989
Firstpage
567
Lastpage
571
Abstract
The use of best-match search strategies in information retrieval systems is discussed. In response to a given query, best-match searching requires the identification of those documents in the collection which are most similar to the query, with similarity being measured by an appropriate closeness function. The emphasis is on heuristics to efficiently locate the closest documents set. The problem is introduced with reference to a straightforward search procedure that returns the best documents manipulating inverted index entries. An improved algorithm is presented which computes in advance an upper bound on closeness, avoiding the exact computation of closeness in many instances and thus optimizing both the number of documents to be evaluated and the number of inverted lists to be inspected. The algorithm is analyzed, and experimental results are reported.<>
Keywords
information retrieval systems; best-match search strategies; closeness function; closest documents set; heuristics; information retrieval systems; inverted index; inverted lists; upper bound; Algorithm design and analysis; Information retrieval; Optical computing;
fLanguage
English
Publisher
ieee
Conference_Titel
Computers and Communications, 1989. Conference Proceedings., Eighth Annual International Phoenix Conference on
Conference_Location
Scottsdale, AZ, USA
Print_ISBN
0-8186-1918-x
Type
conf
DOI
10.1109/PCCC.1989.37447
Filename
37447
Link To Document