DocumentCode :
3570929
Title :
Sense decomposition from E-HowNet for word similarity measurement
Author :
Cheng-Wei Shih ; Yu-Lun Hsieh ; Wen-Lian Hsu
Author_Institution :
Inst. of Inf. Sci., Taipei, Taiwan
fYear :
2014
Firstpage :
613
Lastpage :
618
Abstract :
In this paper, a novel Chinese word similarity measuring algorithm is proposed. It utilizes the information in Chinese dictionaries and decomposition of all the word definition in E-HowNet into semantic attributes which include hidden relationships and meanings. The extracted semantic attributes are regarded as semantic vectors for word similarity measurement. Evaluation results not only show that our word similarity measuring algorithm can achieve a higher correlation with human annotation than other methods, but also can successfully solve the unique edge distance and indistinguishability problem in taxonomy-based word similarity measurement.
Keywords :
correlation methods; dictionaries; natural language processing; string matching; text analysis; Chinese dictionaries; Chinese word similarity measuring algorithm; E-HowNet; human annotation; semantic attribute extraction; semantic attributes; sense decomposition; taxonomy-based word similarity measurement; word definition decomposition; Atmospheric measurements; Dictionaries; Extraterrestrial measurements; Feature extraction; Semantics; Taxonomy; Vectors;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Reuse and Integration (IRI), 2014 IEEE 15th International Conference on
Type :
conf
DOI :
10.1109/IRI.2014.7051946
Filename :
7051946
Link To Document :
بازگشت