DocumentCode :
3568283
Title :
Dimension reduction with coevolutionary genetic algorithm for text classification
Author :
Gasanova, Tatiana ; Sergienko, Roman ; Semenkin, Eugene ; Minker, Wolfgang
Author_Institution :
Institute of Communications Engineering, Ulm University, Albert Einstein-Allee 43, 89081, Germany
Volume :
1
fYear :
2014
Firstpage :
215
Lastpage :
222
Abstract :
Text classification of large-size corpora is time-consuming for implementation of classification algorithms. For this reason, it is important to reduce dimension of text classification problems. We propose a method for dimension reduction based on hierarchical agglomerative clustering of terms and cluster weight optimization using cooperative coevolutionary genetic algorithm. The method was applied on 5 different corpora using several classification methods with different text preprocessing. The method reduces dimension of text classification problem significantly. Classification efficiency increases or decreases non-significantly after clustering with optimization of cluster weights.
Keywords :
Clustering algorithms; Dictionaries; Games; Genetic algorithms; Optimization; Support vector machines; Vocabulary; Clustering; Coevolutionary Algorithm; Dimension Reduction; Text Classification; Text Preprocessing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Informatics in Control, Automation and Robotics (ICINCO), 2014 11th International Conference on
Type :
conf
Filename :
7049774
Link To Document :
بازگشت