Title :
A New Method to Compute the Word Relevance in News Corpus
Author :
Liu Jinpan ; He Liang ; Lin Xin ; Xu Mingmin ; Lu Wei
Author_Institution :
Dept. of Comput. Sci. & Technol., East China Normal Univ., Shanghai, China
Abstract :
In this paper we propose a new method to compute the relevance of term in news corpus. According to the characteristics of news corpus, we first propose that the news corpus should be divided into different channels, second we make use of the feature of news document, we divide the co-occurrence of terms into two cases, on the one hand the co-occurrence in the title of the news, On the other hand the co-occurrence in the news text, we use different methods to compute the co-occurrence. In the end, we introduce the web corpus Wikipedia to overcome some shortcomings of the news corpus.
Keywords :
natural language processing; text analysis; word processing; Wikipedia; news corpus; term cooccurrence; word combination; Computer applications; Computer science; Data mining; Helium; Information retrieval; Lighting; Ontologies; Rockets; TV; Wikipedia;
Conference_Titel :
Intelligent Systems and Applications (ISA), 2010 2nd International Workshop on
Conference_Location :
Wuhan
Print_ISBN :
978-1-4244-5872-1
Electronic_ISBN :
978-1-4244-5874-5
DOI :
10.1109/IWISA.2010.5473239