Title :
Multi-senses and multi-dependencies discovery among words
Author :
Jie, Tang ; JuanZi, Li ; Ke-Hong, Wang ; Yue-Ru, Cai
Author_Institution :
Dept. of Comput., Tsinghua Univ., Beijing, China
Abstract :
Word sense and word dependency benefit many applications. Manually constructed lexicon usually serves as a source for word sense and word dependency. However, there always requires very expensive work and extensive time consumption to compile it, simultaneously senses and relationships among words in these kinds of lexica are always missed. Automatically compiled lexica are under developing, the main obstacle is to discover multisenses and multidependencies among words. We propose a new method combining an improved ISODATA clustering algorithm with association rule mining to answer the question. With the recursively clustering algorithm lower frequency senses are discovered. As well a approach for refinement is put forward to improve the precision. Experiments indicate that the approach presented here provides preferable outputs.
Keywords :
computational linguistics; data mining; pattern clustering; text analysis; word processing; ISODATA clustering algorithm; association rule mining; recursive clustering algorithm; word dependency; word sense; Application software; Association rules; Clustering algorithms; Data mining; Frequency; Knowledge engineering; Machine intelligence; Machine learning algorithms; Ontologies; Software tools;
Conference_Titel :
Natural Language Processing and Knowledge Engineering, 2003. Proceedings. 2003 International Conference on
Conference_Location :
Beijing, China
Print_ISBN :
0-7803-7902-0
DOI :
10.1109/NLPKE.2003.1275883