DocumentCode :
2664351
Title :
Multi-senses and multi-dependencies discovery among words
Author :
Jie, Tang ; JuanZi, Li ; Ke-Hong, Wang ; Yue-Ru, Cai
Author_Institution :
Dept. of Comput., Tsinghua Univ., Beijing, China
fYear :
2003
fDate :
26-29 Oct. 2003
Firstpage :
132
Lastpage :
137
Abstract :
Word sense and word dependency benefit many applications. Manually constructed lexicon usually serves as a source for word sense and word dependency. However, there always requires very expensive work and extensive time consumption to compile it, simultaneously senses and relationships among words in these kinds of lexica are always missed. Automatically compiled lexica are under developing, the main obstacle is to discover multisenses and multidependencies among words. We propose a new method combining an improved ISODATA clustering algorithm with association rule mining to answer the question. With the recursively clustering algorithm lower frequency senses are discovered. As well a approach for refinement is put forward to improve the precision. Experiments indicate that the approach presented here provides preferable outputs.
Keywords :
computational linguistics; data mining; pattern clustering; text analysis; word processing; ISODATA clustering algorithm; association rule mining; recursive clustering algorithm; word dependency; word sense; Application software; Association rules; Clustering algorithms; Data mining; Frequency; Knowledge engineering; Machine intelligence; Machine learning algorithms; Ontologies; Software tools;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Natural Language Processing and Knowledge Engineering, 2003. Proceedings. 2003 International Conference on
Conference_Location :
Beijing, China
Print_ISBN :
0-7803-7902-0
Type :
conf
DOI :
10.1109/NLPKE.2003.1275883
Filename :
1275883
Link To Document :
بازگشت