DocumentCode
3285864
Title
Automatic Chinese Text Classification Based on NSVMDT-KNN
Author
Xu, QiNan ; Liu, Zhijng
Author_Institution
Sch. of Comput. Sci. & Technol., Xidian Univ., Xian
Volume
2
fYear
2008
fDate
18-20 Oct. 2008
Firstpage
410
Lastpage
414
Abstract
According to this paper, a novel approach based on non-linear support vetor machine decision tree (NSVMDT) and K nearest neighbors (KNN) is proposed towards Chinese text categorization. To begin with, SVM is extended to non-linear SVM by using kernel functions. And then the method of NSVMDT is presented based on traditional SVM decision tree. Furthermore, the KNN is combined with NSVMDT to solve the problem of the categorization of unbalanced Chinese texts sets. According to this method, experimental results have shown that the hybrid method based on NSVMDT and KNN could achieve better results than traditional SVM method for Chinese text categorization.
Keywords
decision trees; support vector machines; text analysis; K nearest neighbors; NSVMDT-KNN; automatic Chinese text classification; kernel functions; nonlinear support vector machine decision tree; Decision trees; Entropy; Feature extraction; Fuzzy systems; Mutual information; Statistics; Support vector machine classification; Support vector machines; Text categorization; Text processing; Chinese text categorization; K nearest neighbors (KNN); kernel functions; non-linear support vetor machine decision tree (NSVMDT);
fLanguage
English
Publisher
ieee
Conference_Titel
Fuzzy Systems and Knowledge Discovery, 2008. FSKD '08. Fifth International Conference on
Conference_Location
Shandong
Print_ISBN
978-0-7695-3305-6
Type
conf
DOI
10.1109/FSKD.2008.289
Filename
4666149
Link To Document