DocumentCode :
2463702
Title :
A Method of Text Feature Extraction Based on Weighted Scatter Difference
Author :
Haifeng, Liu ; Zhan, Su ; Zeqing, Yao ; Xueren, Zhang
Author_Institution :
Inst. of Sci., PLA Univ. of Sci. & Technol., Nanjing, China
Volume :
3
fYear :
2010
fDate :
16-17 Dec. 2010
Firstpage :
83
Lastpage :
86
Abstract :
Feature reduction is one of the core technologies of automatic text categorization. As for the scatter difference criterion, poor categorization effect is made when the between-class distance is small and the class density is high. In order to solve this problem, a weighted method based on the sample distribution is shown in the paper, which will make the between-class and within-class scatter matrixes with poor scatter be weighted, to enhance the categorization ability after dimensional reduction and to improve the dimensional reduction effect of linear feature extraction method based on scatter difference. The following experiment tells us that this method is superior to the original maximum scatter difference method in precision rate and recall rate.
Keywords :
feature extraction; information retrieval; text analysis; automatic text categorization; core technologies; feature reduction; text feature extraction method; weighted scatter difference; Covariance matrix; Feature extraction; Imaging; Support vector machine classification; Text categorization; Training; Vectors; feature extraction; feature reduction; scatter difference; text classification;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Intelligent Systems (GCIS), 2010 Second WRI Global Congress on
Conference_Location :
Wuhan
Print_ISBN :
978-1-4244-9247-3
Type :
conf
DOI :
10.1109/GCIS.2010.49
Filename :
5709328
Link To Document :
بازگشت