DocumentCode :
2597576
Title :
Support vector machine text classification system: Using Ant Colony Optimization based feature subset selection
Author :
Mesleh, Abdelwadood Moh´d ; Kanaan, Ghassan
Author_Institution :
Fac. of Eng. Technol., Blaqa´´ Appl. Univ., Amman
fYear :
2008
fDate :
25-27 Nov. 2008
Firstpage :
143
Lastpage :
148
Abstract :
Feature subset selection (FSS) is an important step for effective text classification systems. In this work, we have implemented a support vector machine (SVM) text classifier for Arabic articles. Moreover, we have implemented a novel FSS method based on Ant Colony Optimization (ACO) and Chi-square statistic. The proposed ACO-Based FSS method adapted Chi-square statistic as heuristic information and the effectiveness of the SVM classifier as a guide to improve the selection of features for each category. Compared to the six state-of-the-art FSS methods, our ACO Based-FSS algorithm achieved better TC effectiveness. Evaluation used an in-house Arabic text classification corpus that consists of 1445 documents independently classified into nine categories. The experimental results were presented in terms of macro-averaging precision, macro-averaging recall and macro-averaging F1 measures.
Keywords :
classification; natural languages; optimisation; support vector machines; text analysis; ACO-Based FSS method; Ant Colony Optimization based feature subset selection; Arabic articles; Chi-square statistic; heuristic information; in-house Arabic text classification corpus; support vector machine text classification system; Ant colony optimization; Support vector machine classification; Support vector machines; Text categorization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Engineering & Systems, 2008. ICCES 2008. International Conference on
Conference_Location :
Cairo
Print_ISBN :
978-1-4244-2115-2
Electronic_ISBN :
978-1-4244-2116-9
Type :
conf
DOI :
10.1109/ICCES.2008.4772984
Filename :
4772984
Link To Document :
بازگشت