DocumentCode :
3274660
Title :
Research of Text Categorization on WEKA
Author :
Li Dan ; Liu Lihua ; Zhang Zhaoxin
Author_Institution :
Hebei Software Inst., Baoding, China
fYear :
2013
fDate :
16-18 Jan. 2013
Firstpage :
1129
Lastpage :
1131
Abstract :
The choice of algorithm is a key text categorization problem. In order to evaluation synthetically, analyzed three popular text categorization algorithm that are naive Bayes (NB), decision tree(DT) and support vector machines(SVM). Carried on simulation experiment used the open source data mining tool of Weka. Experimental results show some significant conclusions: The performance of three classification methods are better, including Support vector machine classification of the best performance, highest precision and recall, naive Bayes second, the minimum Decision tree. Also found that classification performance associated not only the choice of the classification algorithm but also the differences between corpus categories.
Keywords :
Bayes methods; decision trees; learning (artificial intelligence); pattern classification; support vector machines; text analysis; DT; NB; SVM; WEKA; classification methods; corpus categories; decision tree; naive Bayes; support vector machines; text categorization; Algorithm design and analysis; Classification algorithms; Decision trees; Kernel; Support vector machine classification; Text categorization; Decision tree; Naive bayes; Support vector machines; Text categorization; Weka;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Intelligent System Design and Engineering Applications (ISDEA), 2013 Third International Conference on
Conference_Location :
Hong Kong
Print_ISBN :
978-1-4673-4893-5
Type :
conf
DOI :
10.1109/ISDEA.2012.266
Filename :
6455773
Link To Document :
بازگشت