DocumentCode :
3019144
Title :
Graph-Based Chinese Text Categorization
Author :
Wang, Zonghu ; Liu, Zhijing
Author_Institution :
Sch. of Comput. Sci. & Technol., Xidian Univ., Xi´´an, China
fYear :
2010
fDate :
25-27 June 2010
Firstpage :
1092
Lastpage :
1095
Abstract :
Most text categorization methods are based on the vector-space model which ignores the structural information, such as the order and co- occurrence of words within the text. In this paper we describe a novel approach of text classification using graph matching. To begin with, we use weight method to select the relevant features to construct the graph. Then we present an improved graph-based text representation model and describe a learning algorithm for building category graphs from the training set. Finally we use the graph matching algorithm we proposed to predict the category of the texts in the testing set. The result shows that the graph matching approach can outperform traditional vector-based NB methods in terms of both accuracy and spend time.
Keywords :
graph theory; text analysis; graph based chinese text categorization; graph matching algorithm; learning algorithm; vector space model; Classification algorithms; Computational modeling; Prediction algorithms; Testing; Text categorization; Training; Weight measurement; categorization; feature; graph; representation; text;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Electrical and Control Engineering (ICECE), 2010 International Conference on
Conference_Location :
Wuhan
Print_ISBN :
978-1-4244-6880-5
Type :
conf
DOI :
10.1109/iCECE.2010.276
Filename :
5631882
Link To Document :
بازگشت