DocumentCode :
1932497
Title :
A comparison of several ensemble methods for text categorization
Author :
Dong, Yan-Shi ; Han, Ke-Song
Author_Institution :
Shanghai Jiao Tong Univ., China
fYear :
2004
fDate :
15-18 Sept. 2004
Firstpage :
419
Lastpage :
422
Abstract :
Text categorization (TC), as an important domain of machine learning, has many unique traits, such as huge number of features, serious redundant features, dataset imbalance, etc. In this paper the various ensemble methods of naive Bayes classifiers and SVM classifiers are experimentally compared on the TC tasks. Besides, a new type of classifiers, moderated asymmetric naive Bayes classifiers, is proposed. Its advantages over the conventional naive Bayes classifiers in performance and computational efficiency are demonstrated.
Keywords :
belief networks; learning (artificial intelligence); pattern classification; support vector machines; text analysis; Bayes classifier; machine learning; support vector machine classifier; text categorization; Bagging; Boosting; Computational efficiency; Machine learning; Neural networks; Niobium; Stacking; Support vector machine classification; Support vector machines; Text categorization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Services Computing, 2004. (SCC 2004). Proceedings. 2004 IEEE International Conference on
Print_ISBN :
0-7695-2225-4
Type :
conf
DOI :
10.1109/SCC.2004.1358033
Filename :
1358033
Link To Document :
بازگشت