DocumentCode :
3284571
Title :
Feature Selection Method of Text Tendency Classification
Author :
Li, Yanling ; Dai, Guanzhong ; Li, Gang
Author_Institution :
Northwestern Poly Tech. Univ., Xi´´an
Volume :
2
fYear :
2008
fDate :
18-20 Oct. 2008
Firstpage :
34
Lastpage :
37
Abstract :
Recently, automatic text categorization has made rapid progress and been one of the hotspots in the information processing field. Text tendency classification is one type of text categorization, which has very important applications in information retrievals bad information identification and filtering , content security management and analysis of public opinion tendency. To aim at the important influence of feature selection on text classification accuracy, this paper mainly studied feature selection method of tendency classification. First, to analyze and summarize the current variety methods, it points out three common ideas of feature selection. Then based on the analysis of complexity of tendency classification, it is proved that feature selection method based on the features´ distribution in text categories is more suitable for tendency classification than the method based on the correlativity of features and categories. Finally, it gives test results for balanced training sets and unbalanced training sets.
Keywords :
classification; text analysis; content security management; feature distribution; feature selection method; information identification; information processing; information retrieval; public opinion tendency; text categorization; text tendency classification; Content based retrieval; Content management; Information analysis; Information filtering; Information filters; Information processing; Information retrieval; Information security; Testing; Text categorization; MI; distribution coefficient; feature selection; imbalanced datasets; text classification; text tendency classification;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Fuzzy Systems and Knowledge Discovery, 2008. FSKD '08. Fifth International Conference on
Conference_Location :
Shandong
Print_ISBN :
978-0-7695-3305-6
Type :
conf
DOI :
10.1109/FSKD.2008.263
Filename :
4666075
Link To Document :
بازگشت