DocumentCode
480738
Title
A Novel Language Model Based on Cognition Attention Attenuation in Web Retrieval
Author
Cao, Donglin ; Xu, Hongbo ; Bai, Shuo ; Cheng, Xueqi ; Li, Shaozi
Author_Institution
Inst. of Comput. Technol., Chinese Acad. of Sci., Beijing
Volume
1
fYear
2008
fDate
9-12 Dec. 2008
Firstpage
676
Lastpage
682
Abstract
Language model is widely used in many retrieval systems. Its document representation is based on the bag of words assumption. Hence, each term in document is treated as an equal object and only the term frequency is considered as the evidence of the importance of term. In this paper, we study the problem of cognition attention attenuation in processing documents and present a cognition attention attenuation based language model. This model estimates the document model by attenuation process of term in document. Compared with the classical language model, the advantage of this model is considering about the document structure which is often used in text summarization. From the experiments results, our novel cognition attention attenuation based language model outperformed the classical language model with Dirichlet smoothing in blog page and Web page.
Keywords
Internet; cognition; computational linguistics; document handling; information retrieval; Web retrieval; bag-of-word assumption; cognition attention attenuation; document processing; document representation; language model; Attenuation; Cognition; Cognitive science; Computers; Couplings; Frequency; Hidden Markov models; Intelligent agent; Natural languages; Smoothing methods;
fLanguage
English
Publisher
ieee
Conference_Titel
Web Intelligence and Intelligent Agent Technology, 2008. WI-IAT '08. IEEE/WIC/ACM International Conference on
Conference_Location
Sydney, NSW
Print_ISBN
978-0-7695-3496-1
Type
conf
DOI
10.1109/WIIAT.2008.19
Filename
4740529
Link To Document