DocumentCode :
2425430
Title :
Combination Methodologies of Text Classifier: Design and Implementation
Author :
Bai Rujiang ; Wang Xiaoyue
Author_Institution :
Shandong Univ. of Technol. Libr., Zibo
Volume :
4
fYear :
2007
fDate :
24-27 Aug. 2007
Firstpage :
21
Lastpage :
25
Abstract :
Support vector machines, one of the most population techniques for classification, have been widely used in many application areas. The kernel parameters setting for SVM in a training process impacts on the classification accuracy. Feature selection is another factor that impacts classification accuracy .The objective of this work is to reduce the dimension of feature vectors, optimizing the parameters to improve the SVM classification accuracy and speed. We present rough set method for feature reduce and a genetic algorithm approach for feature selection and parameters optimization to solve this kind of problem. We tried Reuters 21578 using the proposed method. Experimental results indicate, compared with the traditional methods, our proposed method significantly improves the classification accuracy and has fewer input features for support vector machines.
Keywords :
classification; genetic algorithms; rough set theory; support vector machines; text analysis; Reuters 21578; classification accuracy; feature selection; feature vectors; genetic algorithm; rough set method; support vector machines; text classifier; Genetic algorithms; Instruments; Kernel; Libraries; Optimization methods; Organizing; Rough sets; Support vector machine classification; Support vector machines; Text categorization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Fuzzy Systems and Knowledge Discovery, 2007. FSKD 2007. Fourth International Conference on
Conference_Location :
Haikou
Print_ISBN :
978-0-7695-2874-8
Type :
conf
DOI :
10.1109/FSKD.2007.222
Filename :
4406346
Link To Document :
بازگشت