Title :
Graph-Based Chinese Text Categorization
Author :
Wang, Zonghu ; Liu, Zhijing
Author_Institution :
Sch. of Comput. Sci. & Technol., Xidian Univ., Xi´´an, China
Abstract :
Most text categorization methods are based on the vector-space model which ignores the structural information, such as the order and co- occurrence of words within the text. In this paper we describe a novel approach of text classification using graph matching. To begin with, we use weight method to select the relevant features to construct the graph. Then we present an improved graph-based text representation model and describe a learning algorithm for building category graphs from the training set. Finally we use the graph matching algorithm we proposed to predict the category of the texts in the testing set. The result shows that the graph matching approach can outperform traditional vector-based NB methods in terms of both accuracy and spend time.
Keywords :
graph theory; text analysis; graph based chinese text categorization; graph matching algorithm; learning algorithm; vector space model; Classification algorithms; Computational modeling; Prediction algorithms; Testing; Text categorization; Training; Weight measurement; categorization; feature; graph; representation; text;
Conference_Titel :
Electrical and Control Engineering (ICECE), 2010 International Conference on
Conference_Location :
Wuhan
Print_ISBN :
978-1-4244-6880-5
DOI :
10.1109/iCECE.2010.276