• DocumentCode
    644012
  • Title

    LDA based PSEUDO relevance feedback for cross language information retrieval

  • Author

    Xuwen Wang ; Qiang Zhang ; Xiaojie Wang ; Yueping Sun

  • Author_Institution
    Beijing Univ. of Posts & Telecommun., Beijing, China
  • Volume
    03
  • fYear
    2012
  • fDate
    Oct. 30 2012-Nov. 1 2012
  • Firstpage
    1511
  • Lastpage
    1516
  • Abstract
    This paper introduced a LDA-based pseudo relevance feedback (PRF) model for cross language information retrieval. To validate the performance of PRF techniques in CLIR task, we conducted cross language query expansion experiments based on a self-constructed CLIR system, the LDA-based PRF model was applied before or after the query translating process, namely the pre-translation-PRF, the post-translation-PRF, and the combined-PRF strategy. We also compared this model with the classical VSM-based PRF algorithm. Experiment results showed that the proposed LDA-based PRF method was effective for improving the performance of CLIR.
  • Keywords
    natural language processing; query processing; relevance feedback; vectors; CLIR task; LDA based pseudo relevance feedback; PRF technique performance; combined-PRF strategy; cross language information retrieval; cross language query expansion experiments; post-translation-PRF; pretranslation-PRF; query translating process; self-constructed CLIR system; Approximation methods; Data models; Google; Information retrieval; Research and development; Resource management; Vectors; PSEUDO relevance feed back; cross language information retrieval; latent dirichlet allocation (LDA); query expansion; vector space model (VSM);
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Cloud Computing and Intelligent Systems (CCIS), 2012 IEEE 2nd International Conference on
  • Conference_Location
    Hangzhou
  • Print_ISBN
    978-1-4673-1855-6
  • Type

    conf

  • DOI
    10.1109/CCIS.2012.6664637
  • Filename
    6664637