Title :
Combination Methodologies of Text Classifier: Design and Implementation
Author :
Bai Rujiang ; Wang Xiaoyue
Author_Institution :
Shandong Univ. of Technol. Libr., Zibo
Abstract :
Support vector machines, one of the most population techniques for classification, have been widely used in many application areas. The kernel parameters setting for SVM in a training process impacts on the classification accuracy. Feature selection is another factor that impacts classification accuracy .The objective of this work is to reduce the dimension of feature vectors, optimizing the parameters to improve the SVM classification accuracy and speed. We present rough set method for feature reduce and a genetic algorithm approach for feature selection and parameters optimization to solve this kind of problem. We tried Reuters 21578 using the proposed method. Experimental results indicate, compared with the traditional methods, our proposed method significantly improves the classification accuracy and has fewer input features for support vector machines.
Keywords :
classification; genetic algorithms; rough set theory; support vector machines; text analysis; Reuters 21578; classification accuracy; feature selection; feature vectors; genetic algorithm; rough set method; support vector machines; text classifier; Genetic algorithms; Instruments; Kernel; Libraries; Optimization methods; Organizing; Rough sets; Support vector machine classification; Support vector machines; Text categorization;
Conference_Titel :
Fuzzy Systems and Knowledge Discovery, 2007. FSKD 2007. Fourth International Conference on
Conference_Location :
Haikou
Print_ISBN :
978-0-7695-2874-8
DOI :
10.1109/FSKD.2007.222