DocumentCode :
182984
Title :
A novel feature voting model for text classification
Author :
Sen Jia ; Jinquan Liang ; Yao Xie ; Lin Deng
Author_Institution :
Key Lab. of Spatial Inf. Intell. Perception & Services, Shenzhen Univ., Shenzhen, China
fYear :
2014
fDate :
19-21 Aug. 2014
Firstpage :
306
Lastpage :
311
Abstract :
Along with the information explosion in the Internet era, the traditional classification methods, such as KNN (k-nearest neighbor), Naive Bayes (NB), encounter bottlenecks due to the endless stream of new words. In this paper, through comparing with the Rocchio and Bayesian algorithms, it has been found that centroid-based algorithms are insufficient for text classification. Therefore, a novel feature voting model is proposed, which gives rise to a bag-of-words based feature voting algorithm for text classification. This algorithm assigns categories for each document according to the ranking of weighted sum of feature values. Experimental results have shown the efficiency of the proposed method over the other state-of-the-art methods.
Keywords :
Bayes methods; pattern classification; text analysis; Bayesian algorithm; Internet; KNN; NB; Rocchio algorithm; bag-of-words based feature voting algorithm; centroid-based algorithms; information explosion; k-nearest neighbor; naive Bayes; text classification; Accuracy; Classification algorithms; Equations; Internet; Mathematical model; Training; Vectors; Naive Bayes; feature voting; text classification;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Fuzzy Systems and Knowledge Discovery (FSKD), 2014 11th International Conference on
Conference_Location :
Xiamen
Print_ISBN :
978-1-4799-5147-5
Type :
conf
DOI :
10.1109/FSKD.2014.6980851
Filename :
6980851
Link To Document :
بازگشت