DocumentCode :
2566550
Title :
Study on feature selection in finance text categorization
Author :
Sun, Changqiu ; Wang, Xiaolong ; Xu, Jun
Author_Institution :
Dept. of Comput. Sci. & Technol., Harbin Inst. of Technol., Shenzhen, China
fYear :
2009
fDate :
11-14 Oct. 2009
Firstpage :
5077
Lastpage :
5082
Abstract :
Document genre information is one of the most distinguishing features in information retrieval, which brings order to the search results. What the genre classification concerned is not the topic but the genre of document. In this paper, two different feature sets were employed: bag of words which are derived by feature selection method and structural features which are selected manually and subjectively. And a comparative study on feature selection in genre classification of Chinese finance text is presented. In empirical results with classifiers on the real world corpora, we find that those manual labeled features can improve the performance clearly.
Keywords :
classification; financial data processing; information retrieval; text analysis; document genre information; feature selection; finance text categorization; information retrieval; structural feature; Computer science; Cybernetics; Feature extraction; Finance; IEEE news; Information retrieval; Search engines; Sun; Text categorization; World Wide Web; Feature Selection; Genre Classification; Text Categorization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Systems, Man and Cybernetics, 2009. SMC 2009. IEEE International Conference on
Conference_Location :
San Antonio, TX
ISSN :
1062-922X
Print_ISBN :
978-1-4244-2793-2
Electronic_ISBN :
1062-922X
Type :
conf
DOI :
10.1109/ICSMC.2009.5346030
Filename :
5346030
Link To Document :
بازگشت