• DocumentCode
    2597576
  • Title

    Support vector machine text classification system: Using Ant Colony Optimization based feature subset selection

  • Author

    Mesleh, Abdelwadood Moh´d ; Kanaan, Ghassan

  • Author_Institution
    Fac. of Eng. Technol., Blaqa´´ Appl. Univ., Amman
  • fYear
    2008
  • fDate
    25-27 Nov. 2008
  • Firstpage
    143
  • Lastpage
    148
  • Abstract
    Feature subset selection (FSS) is an important step for effective text classification systems. In this work, we have implemented a support vector machine (SVM) text classifier for Arabic articles. Moreover, we have implemented a novel FSS method based on Ant Colony Optimization (ACO) and Chi-square statistic. The proposed ACO-Based FSS method adapted Chi-square statistic as heuristic information and the effectiveness of the SVM classifier as a guide to improve the selection of features for each category. Compared to the six state-of-the-art FSS methods, our ACO Based-FSS algorithm achieved better TC effectiveness. Evaluation used an in-house Arabic text classification corpus that consists of 1445 documents independently classified into nine categories. The experimental results were presented in terms of macro-averaging precision, macro-averaging recall and macro-averaging F1 measures.
  • Keywords
    classification; natural languages; optimisation; support vector machines; text analysis; ACO-Based FSS method; Ant Colony Optimization based feature subset selection; Arabic articles; Chi-square statistic; heuristic information; in-house Arabic text classification corpus; support vector machine text classification system; Ant colony optimization; Support vector machine classification; Support vector machines; Text categorization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Engineering & Systems, 2008. ICCES 2008. International Conference on
  • Conference_Location
    Cairo
  • Print_ISBN
    978-1-4244-2115-2
  • Electronic_ISBN
    978-1-4244-2116-9
  • Type

    conf

  • DOI
    10.1109/ICCES.2008.4772984
  • Filename
    4772984