DocumentCode :
2036173
Title :
Research into the topic´s representation in topic tracking
Author :
Zhao, Hua ; Wang, FengLing
Author_Institution :
Coll. of Inf. Sci. & Eng., Shandong Univ. of Sci. & Technol., Qingdao, China
Volume :
6
fYear :
2010
fDate :
10-12 Aug. 2010
Firstpage :
2498
Lastpage :
2502
Abstract :
This paper uses Vector Space Model to represent topic, and focuses on the creation of the model. Based on the analysis of the characteristics of the English news stories, the paper proposed two methods to improve the topic´s representation. Firstly, we propose a news story-oriented feature extraction algorithm based on the combination of word analysis and the location characteristic of the news stories. The basic idea of word analysis is to divide the words into capital words and common words. The location characteristic decides the importance of the words based on the inverse-pyramidal structure of the news stories. Secondly, we present a new method to compute the feature´s weight based on the fusion of several feature extraction methods. This method gives the feature bigger weight, which is selected by more feature extraction algorithms. Experimental results indicate these two proposed methods perform well.
Keywords :
feature extraction; information resources; text analysis; vectors; word processing; English news stories; capital words; common words; feature weight; fusion; inverse-pyramidal structure; location characteristic; news story-oriented feature extraction; topic representation; topic tracking; vector space model; word analysis; Algorithm design and analysis; Feature extraction; Hidden Markov models; Information retrieval; Target tracking; Training;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Fuzzy Systems and Knowledge Discovery (FSKD), 2010 Seventh International Conference on
Conference_Location :
Yantai, Shandong
Print_ISBN :
978-1-4244-5931-5
Type :
conf
DOI :
10.1109/FSKD.2010.5569601
Filename :
5569601
Link To Document :
بازگشت