• DocumentCode
    2330667
  • Title

    Topic based language models for ad hoc information retrieval

  • Author

    Azzopardi, L. ; Girolami, M. ; van Rijsbergen, C.J.

  • Author_Institution
    Sch. of ICT, Paisley Univ., UK
  • Volume
    4
  • fYear
    2004
  • fDate
    25-29 July 2004
  • Firstpage
    3281
  • Abstract
    We propose a topic based approach to language modelling for ad-hoc information retrieval (IR). Many smoothed estimators used for the multinomial query model in IR rely upon the estimated background collection probabilities. In this paper, we propose a topic based language modelling approach, that uses a more informative prior based on the topical content of a document. In our experiments, the proposed model provides comparable IR performance to the standard models, but when combined in a two stage language model, it outperforms all other estimated models.
  • Keywords
    information retrieval; probability; ad hoc information retrieval; background collection probability; language model; multinomial query model; Hidden Markov models; Information retrieval; Mathematical model; Optical computing; Smoothing methods; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Neural Networks, 2004. Proceedings. 2004 IEEE International Joint Conference on
  • Conference_Location
    Budapest
  • ISSN
    1098-7576
  • Print_ISBN
    0-7803-8359-1
  • Type

    conf

  • DOI
    10.1109/IJCNN.2004.1381205
  • Filename
    1381205