DocumentCode
2430116
Title
Applying cascaded feature selection to SVM text categorization
Author
Masuyama, Takeshi ; Nakagawa, Hiroshi
Author_Institution
Inf. Technol. Center, Tokyo Univ., Japan
fYear
2002
fDate
2-6 Sept. 2002
Firstpage
241
Lastpage
245
Abstract
This paper investigates the effect of a cascaded feature selection (CFS) in SVM text categorization. Unlike existing feature selections, our method (CFS) has two advantages. One can make use of the characteristic of each feature (word). Another is that unnecessary test documents for a category, which should be categorized into a negative set, can be removed in the first step. Compared with the method which does not apply CFS, our method achieved significant good performance especially about the categories which contain a small number of training documents.
Keywords
data mining; feature extraction; learning (artificial intelligence); text analysis; SVM text categorization; cascaded feature selection; test documents; training documents; Humans; Information technology; Organizing; Quality management; Search engines; Support vector machine classification; Support vector machines; Testing; Text categorization; Web sites;
fLanguage
English
Publisher
ieee
Conference_Titel
Database and Expert Systems Applications, 2002. Proceedings. 13th International Workshop on
ISSN
1529-4188
Print_ISBN
0-7695-1668-8
Type
conf
DOI
10.1109/DEXA.2002.1045905
Filename
1045905
Link To Document