Title of article :
Searching in Medline: Query expansion and manual indexing evaluation
Author/Authors :
Samir Abdou، نويسنده , , Jacques Savoy، نويسنده ,
Issue Information :
دوماهنامه با شماره پیاپی سال 2008
Abstract :
Based on a relatively large subset representing one third of the Medline collection, this paper evaluates ten different IR models, including recent developments in both probabilistic and language models. We show that the best performing IR models is a probabilistic model developed within the Divergence from Randomness framework [Amati, G., & van Rijsbergen, C.J. (2002) Probabilistic models of information retrieval based on measuring the divergence from randomness. ACM-Transactions on Information Systems 20(4), 357–389], which result in 170% enhancements in mean average precision when compared to the classical tf idf vector-space model. This paper also reports on our impact evaluations on the retrieval effectiveness of manually assigned descriptors (MeSH or Medical Subject Headings), showing that by including these terms retrieval performance can improve from 2.4% to 13.5%, depending on the underling IR model. Finally, we design a new general blind-query expansion approach showing improved retrieval performances compared to those obtained using the Rocchio approach.
Keywords :
Blind query expansion , Manual indexing , MEDLINE , MeSH , Probabilistic model , LANGUAGE MODEL , Genomics TREC , Rocchio query expansion , Evaluation
Journal title :
Information Processing and Management
Journal title :
Information Processing and Management