Title :
An Optimal Text Categorization Algorithm Based on SVM
Author :
Wang, Ziqiang ; Sun, Xia ; Zhang, Dexian
Author_Institution :
Sch. of Inf. & Eng., Henan Univ. of Technol., Zheng Zhou
Abstract :
Text categorization is the process of assigning documents to a set of previously fixed categories. In this paper we develop an optimal SVM algorithm for text classification via multiple optimal strategies, such as a novel importance weight definition, the feature selection using the likelihood ratio for binomial distribution, the optimal parameter settings, etc. Comparison between our method and other conventional text classification algorithms is conducted on Reuter and TREC corpora. The experimental results indicate that our proposed algorithm yields much better performance than other conventional algorithms
Keywords :
binomial distribution; support vector machines; text analysis; Reuter; SVM; TREC corpora; binomial distribution; documents assignment; likelihood ratio; text categorization algorithm; Artificial intelligence; Classification algorithms; Frequency; Organizing; Routing; Statistical analysis; Sun; Support vector machine classification; Support vector machines; Text categorization;
Conference_Titel :
Communications, Circuits and Systems Proceedings, 2006 International Conference on
Conference_Location :
Guilin
Print_ISBN :
0-7803-9584-0
Electronic_ISBN :
0-7803-9585-9
DOI :
10.1109/ICCCAS.2006.284921