Title :
Graph-based KNN text classification
Author :
Wang, Zonghu ; Liu, Zhijing
Author_Institution :
Sch. of Comput. Sci. & Technol., Xidian Univ., Xi´´an, China
Abstract :
Vector space model is used in most text categorization methods without considering the important information such as the order and co- occurrence of words within the text. In this paper we describe a novel approach of text classification using graph-based KNN. We reduce the number of features dimensions by a combined feature selection method. Then we present an improved graph-based text representation model and describe a novel graph-based KNN algorithm to predict the category of the texts in the testing set. The result shows that our approach can outperform traditional VSM-based KNN methods in terms of both accuracy and cost time.
Keywords :
graph theory; pattern classification; text analysis; KNN text classification; feature selection method; graph-based KNN; k-nearest neighbor; text categorization methods; vector space model; Classification algorithms; Computational modeling; Mutual information; Support vector machine classification; Testing; Text categorization; Training; KNN; categorization; feature; graph; text;
Conference_Titel :
Fuzzy Systems and Knowledge Discovery (FSKD), 2010 Seventh International Conference on
Conference_Location :
Yantai, Shandong
Print_ISBN :
978-1-4244-5931-5
DOI :
10.1109/FSKD.2010.5569866