DocumentCode :
2729910
Title :
Method for feature word weight calculating
Author :
Li, Yanling ; Yuan, Jing ; Ye, Xia
Volume :
1
fYear :
2009
fDate :
20-22 Nov. 2009
Firstpage :
309
Lastpage :
312
Abstract :
Automatic text categorization has been one of the hotspots in the information processing field. To aim at the important impact of feature weight calculating on text classification accuracy, first, the relationship between text representation model and feature weight calculating is studied, and the existed methods of feature weight calculating are analyzed, then the common idea of feature weighting for vector space model (VSM) summarized. Second, the feature weighting thoughts for class space model (CSM) are gived, and on the basis of some existed methods, a number of feature weight calculating methods for CSM are proposed. Experimental results show that the proposed methods are effective in improving classification performance. After analyzing the experimental results, this paper points out that the appearance and the disappearance of feature words together to consider will lower the classification accuracy, so the favorable factors of a class should be highlighted, and the negative side should be ignored or minused.
Keywords :
classification; text analysis; automatic text categorization; class space model; feature word weight calculating; text classification accuracy; text representation model; vector space model; Automation; Educational institutions; Functional analysis; Information analysis; Information processing; Information retrieval; Performance analysis; Statistics; Testing; Text categorization; category weight; class space model (CSM); feature weight; text classification; vector space model (VSM);
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Intelligent Computing and Intelligent Systems, 2009. ICIS 2009. IEEE International Conference on
Conference_Location :
Shanghai
Print_ISBN :
978-1-4244-4754-1
Electronic_ISBN :
978-1-4244-4738-1
Type :
conf
DOI :
10.1109/ICICISYS.2009.5357840
Filename :
5357840
Link To Document :
بازگشت