DocumentCode :
673009
Title :
A Hybrid Text Classification Method Based on K-Congener-Nearest-Neighbors and Hypersphere Support Vector Machine
Author :
Chen, Y.H. ; Zheng, Yuan F. ; Pan, J.F. ; Yang, Nan
Author_Institution :
Sch. of Inf., Zhejiang Univ. of Finance & Econ., Hangzhou, China
fYear :
2013
fDate :
16-17 Nov. 2013
Firstpage :
493
Lastpage :
497
Abstract :
Our work implements a novel text classifier by combining k congener nearest neighbors-Support Vector Machine(KCNN-SVM) with hyper sphere Support Vector Machine(hyper sphere-SVM) training algorithm. Hyper plane Support Vector Machine has been widely used to divide the samples into two equal categories. However, the hyper sphere Support Vector Machine can not only separate the samples, but also divide them into two different parts. Since the probability inside and outside the hyper sphere is not same, hyper sphere-SVM is helpful to the classification when the datasets are imbalanced that we can control the radius of hyper sphere to get higher accuracy. The KCNN-SVM algorithm distinguishes a sample with its nearest neighbor´s category label as well as the average distance between it and its k nearest same kind of neighbors which can enhance the accuracy when the samples are chaotic imbalanced. In this paper, we propose the hyper sphere-KCNN-SVM(HS-KCNN-SVM) hybrid approach which can validly improve the classification accuracy especially for those chaotic imbalanced samples.
Keywords :
chaos; pattern classification; support vector machines; text analysis; HS-KCNN-SVM; chaotic imbalanced samples; hybrid text classification method; hyperplane support vector machine; hypersphere radius; hypersphere support vector machine; hypersphere-KCNN-SVM hybrid approach; hypersphere-SVM; k congener nearest neighbors-support vector machine; nearest neighbor category label; text classifier; Accuracy; Classification algorithms; Educational institutions; Support vector machine classification; Text categorization; Training; HS-KCNN-SVM; Support Vector Machine; hypersphere; k congener nearest neighbors; text classification;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Technology and Applications (ITA), 2013 International Conference on
Conference_Location :
Chengdu
Print_ISBN :
978-1-4799-2876-7
Type :
conf
DOI :
10.1109/ITA.2013.120
Filename :
6710036
Link To Document :
بازگشت