Title :
A Graph-Structure-Based Method for Chinese Document Representation towards Clustering Application
Author :
Liu, Qiaofeng ; Wu, Jiangning ; Wang, Yonggui
Author_Institution :
Inst. of Syst. Eng., Dalian Univ. of Technol., Dalian
Abstract :
In this paper, we propose a graph-structure-based method to represent knowledge for Chinese document clustering. First, we introduce a new knowledge representation method called Graph Space Model (GSM) to convert each document to a graph structure, and then we adopt Maximum Common Subgraph (MCS) to compute the similarities between any two graph structures, which can be further used for document clustering. The results show that the GSM approach can outperform VSM method in representing capability of Chinese documents.
Keywords :
document handling; graph theory; knowledge representation; natural language processing; pattern clustering; Chinese document clustering; graph space model; graph structure; knowledge representation; maximum common subgraph; Computer aided software engineering; Frequency measurement; GSM; Knowledge representation; Systems engineering and theory;
Conference_Titel :
Wireless Communications, Networking and Mobile Computing, 2008. WiCOM '08. 4th International Conference on
Conference_Location :
Dalian
Print_ISBN :
978-1-4244-2107-7
Electronic_ISBN :
978-1-4244-2108-4
DOI :
10.1109/WiCom.2008.1195