Title :
Chinese Keywords Clustering Based on SOM
Author :
Wang, Yi ; Jin, Hu
Author_Institution :
ChengDu Univ. of Inf. Technol., Chengdu
Abstract :
Keyword clustering is useful for text information retrieval, text document classification and so on. This paper introduces an unsupervised method to cluster Chinese keyword by the artificial neural network of SOM (self-organized map). Keywords are encoded into numeric vectors by the similarities of their contextual word sets, which are composed by their neighbor words in the range of phrases. The experimental result shows that words can be clustered on the map according to both of their syntactic and semantic features.
Keywords :
information retrieval; natural language processing; pattern clustering; self-organising feature maps; text analysis; Chinese keywords clustering; document classification; self-organized map; text information retrieval; Artificial neural networks; Clustering algorithms; Clustering methods; Frequency; Information retrieval; Information technology; Natural languages; Neural networks; Neurons; Text categorization; self-organized map; unsupervised machine learning; word clustering;
Conference_Titel :
Natural Computation, 2008. ICNC '08. Fourth International Conference on
Print_ISBN :
978-0-7695-3304-9
DOI :
10.1109/ICNC.2008.927