DocumentCode
2463702
Title
A Method of Text Feature Extraction Based on Weighted Scatter Difference
Author
Haifeng, Liu ; Zhan, Su ; Zeqing, Yao ; Xueren, Zhang
Author_Institution
Inst. of Sci., PLA Univ. of Sci. & Technol., Nanjing, China
Volume
3
fYear
2010
fDate
16-17 Dec. 2010
Firstpage
83
Lastpage
86
Abstract
Feature reduction is one of the core technologies of automatic text categorization. As for the scatter difference criterion, poor categorization effect is made when the between-class distance is small and the class density is high. In order to solve this problem, a weighted method based on the sample distribution is shown in the paper, which will make the between-class and within-class scatter matrixes with poor scatter be weighted, to enhance the categorization ability after dimensional reduction and to improve the dimensional reduction effect of linear feature extraction method based on scatter difference. The following experiment tells us that this method is superior to the original maximum scatter difference method in precision rate and recall rate.
Keywords
feature extraction; information retrieval; text analysis; automatic text categorization; core technologies; feature reduction; text feature extraction method; weighted scatter difference; Covariance matrix; Feature extraction; Imaging; Support vector machine classification; Text categorization; Training; Vectors; feature extraction; feature reduction; scatter difference; text classification;
fLanguage
English
Publisher
ieee
Conference_Titel
Intelligent Systems (GCIS), 2010 Second WRI Global Congress on
Conference_Location
Wuhan
Print_ISBN
978-1-4244-9247-3
Type
conf
DOI
10.1109/GCIS.2010.49
Filename
5709328
Link To Document