DocumentCode :
2477376
Title :
A New Method to Compute the Word Relevance in News Corpus
Author :
Liu Jinpan ; He Liang ; Lin Xin ; Xu Mingmin ; Lu Wei
Author_Institution :
Dept. of Comput. Sci. & Technol., East China Normal Univ., Shanghai, China
fYear :
2010
fDate :
22-23 May 2010
Firstpage :
1
Lastpage :
6
Abstract :
In this paper we propose a new method to compute the relevance of term in news corpus. According to the characteristics of news corpus, we first propose that the news corpus should be divided into different channels, second we make use of the feature of news document, we divide the co-occurrence of terms into two cases, on the one hand the co-occurrence in the title of the news, On the other hand the co-occurrence in the news text, we use different methods to compute the co-occurrence. In the end, we introduce the web corpus Wikipedia to overcome some shortcomings of the news corpus.
Keywords :
natural language processing; text analysis; word processing; Wikipedia; news corpus; term cooccurrence; word combination; Computer applications; Computer science; Data mining; Helium; Information retrieval; Lighting; Ontologies; Rockets; TV; Wikipedia;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Intelligent Systems and Applications (ISA), 2010 2nd International Workshop on
Conference_Location :
Wuhan
Print_ISBN :
978-1-4244-5872-1
Electronic_ISBN :
978-1-4244-5874-5
Type :
conf
DOI :
10.1109/IWISA.2010.5473239
Filename :
5473239
Link To Document :
بازگشت