DocumentCode
3101789
Title
An efficient feature selection method using named entity recognition for Chinese text categorization
Author
Liu, Bin ; Li, Chunping
Author_Institution
Sch. of Software, Tsinghua Univ., Beijing, China
Volume
6
fYear
2009
fDate
12-15 July 2009
Firstpage
3527
Lastpage
3531
Abstract
Feature selection is an important task for text categorization. Traditional feature selection methods are based on terms but they may lose some useful information in texts. In this paper, we present a feature selection method that considers not only general terms but also named entities. Corresponding to our feature selection method, we propose a term weighting scheme for named entities. The experiments show that our method is effective comparing with traditional methods.
Keywords
text analysis; Chinese text categorization; feature selection; named entities; named entity recognition; term weighting scheme; Cybernetics; Machine learning; Text categorization; Text recognition; Text categorization; feature selection; named entity recognition; term weighting;
fLanguage
English
Publisher
ieee
Conference_Titel
Machine Learning and Cybernetics, 2009 International Conference on
Conference_Location
Baoding
Print_ISBN
978-1-4244-3702-3
Electronic_ISBN
978-1-4244-3703-0
Type
conf
DOI
10.1109/ICMLC.2009.5212749
Filename
5212749
Link To Document