DocumentCode
2477376
Title
A New Method to Compute the Word Relevance in News Corpus
Author
Liu Jinpan ; He Liang ; Lin Xin ; Xu Mingmin ; Lu Wei
Author_Institution
Dept. of Comput. Sci. & Technol., East China Normal Univ., Shanghai, China
fYear
2010
fDate
22-23 May 2010
Firstpage
1
Lastpage
6
Abstract
In this paper we propose a new method to compute the relevance of term in news corpus. According to the characteristics of news corpus, we first propose that the news corpus should be divided into different channels, second we make use of the feature of news document, we divide the co-occurrence of terms into two cases, on the one hand the co-occurrence in the title of the news, On the other hand the co-occurrence in the news text, we use different methods to compute the co-occurrence. In the end, we introduce the web corpus Wikipedia to overcome some shortcomings of the news corpus.
Keywords
natural language processing; text analysis; word processing; Wikipedia; news corpus; term cooccurrence; word combination; Computer applications; Computer science; Data mining; Helium; Information retrieval; Lighting; Ontologies; Rockets; TV; Wikipedia;
fLanguage
English
Publisher
ieee
Conference_Titel
Intelligent Systems and Applications (ISA), 2010 2nd International Workshop on
Conference_Location
Wuhan
Print_ISBN
978-1-4244-5872-1
Electronic_ISBN
978-1-4244-5874-5
Type
conf
DOI
10.1109/IWISA.2010.5473239
Filename
5473239
Link To Document