• DocumentCode
    480738
  • Title

    A Novel Language Model Based on Cognition Attention Attenuation in Web Retrieval

  • Author

    Cao, Donglin ; Xu, Hongbo ; Bai, Shuo ; Cheng, Xueqi ; Li, Shaozi

  • Author_Institution
    Inst. of Comput. Technol., Chinese Acad. of Sci., Beijing
  • Volume
    1
  • fYear
    2008
  • fDate
    9-12 Dec. 2008
  • Firstpage
    676
  • Lastpage
    682
  • Abstract
    Language model is widely used in many retrieval systems. Its document representation is based on the bag of words assumption. Hence, each term in document is treated as an equal object and only the term frequency is considered as the evidence of the importance of term. In this paper, we study the problem of cognition attention attenuation in processing documents and present a cognition attention attenuation based language model. This model estimates the document model by attenuation process of term in document. Compared with the classical language model, the advantage of this model is considering about the document structure which is often used in text summarization. From the experiments results, our novel cognition attention attenuation based language model outperformed the classical language model with Dirichlet smoothing in blog page and Web page.
  • Keywords
    Internet; cognition; computational linguistics; document handling; information retrieval; Web retrieval; bag-of-word assumption; cognition attention attenuation; document processing; document representation; language model; Attenuation; Cognition; Cognitive science; Computers; Couplings; Frequency; Hidden Markov models; Intelligent agent; Natural languages; Smoothing methods;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Web Intelligence and Intelligent Agent Technology, 2008. WI-IAT '08. IEEE/WIC/ACM International Conference on
  • Conference_Location
    Sydney, NSW
  • Print_ISBN
    978-0-7695-3496-1
  • Type

    conf

  • DOI
    10.1109/WIIAT.2008.19
  • Filename
    4740529