DocumentCode
2425430
Title
Combination Methodologies of Text Classifier: Design and Implementation
Author
Bai Rujiang ; Wang Xiaoyue
Author_Institution
Shandong Univ. of Technol. Libr., Zibo
Volume
4
fYear
2007
fDate
24-27 Aug. 2007
Firstpage
21
Lastpage
25
Abstract
Support vector machines, one of the most population techniques for classification, have been widely used in many application areas. The kernel parameters setting for SVM in a training process impacts on the classification accuracy. Feature selection is another factor that impacts classification accuracy .The objective of this work is to reduce the dimension of feature vectors, optimizing the parameters to improve the SVM classification accuracy and speed. We present rough set method for feature reduce and a genetic algorithm approach for feature selection and parameters optimization to solve this kind of problem. We tried Reuters 21578 using the proposed method. Experimental results indicate, compared with the traditional methods, our proposed method significantly improves the classification accuracy and has fewer input features for support vector machines.
Keywords
classification; genetic algorithms; rough set theory; support vector machines; text analysis; Reuters 21578; classification accuracy; feature selection; feature vectors; genetic algorithm; rough set method; support vector machines; text classifier; Genetic algorithms; Instruments; Kernel; Libraries; Optimization methods; Organizing; Rough sets; Support vector machine classification; Support vector machines; Text categorization;
fLanguage
English
Publisher
ieee
Conference_Titel
Fuzzy Systems and Knowledge Discovery, 2007. FSKD 2007. Fourth International Conference on
Conference_Location
Haikou
Print_ISBN
978-0-7695-2874-8
Type
conf
DOI
10.1109/FSKD.2007.222
Filename
4406346
Link To Document