Title :
Keyword Combination Extraction in Text Categorization Based on Ant Colony Optimization
Author :
Yu, Zi-jun ; Wu, Wei-gang ; Xiao, Jing ; Zhang, Jun ; Huang, Rui-Zhang ; Liu, Ou
Author_Institution :
Dept. of Comput. Sci., SUN yat-sen Uninversity, Guangzhou, China
Abstract :
Due to the increasing number of documents in digital form, the automated text categorization (TC) has become more and more promising in the last ten years. A TC system can automatically assign a document with the most suitable category, but the reason for such an assignment is usually unknown by users. To make the TC system be interpretable, it is necessary to select a group of keywords, or termed a keyword combination, to describe each text category. In this paper, we propose a novel algorithm, keyword combination extraction based on ant colony optimization (KCEACO), to search the optimal keyword combination of a target category. By extending the traditional feature selection techniques, an evaluation function is designed for evaluating a keyword combination. This function takes into account the relationships among different keywords. Experimental results show that KCEACO can efficiently find the optimal keyword combination from a large number of candidate combinations.
Keywords :
feature extraction; optimisation; text analysis; automated text categorization; feature selection techniques; keyword combination extraction based on ant colony optimization; optimal keyword combination; Ant colony optimization; Computer science; Data mining; Electronic publishing; Information retrieval; Pattern recognition; Software libraries; Sun; Support vector machines; Text categorization; ant colony optimization; concept learning; feature selection; keyword combination extraction; text categorization;
Conference_Titel :
Soft Computing and Pattern Recognition, 2009. SOCPAR '09. International Conference of
Conference_Location :
Malacca
Print_ISBN :
978-1-4244-5330-6
Electronic_ISBN :
978-0-7695-3879-2
DOI :
10.1109/SoCPaR.2009.90