DocumentCode
3570929
Title
Sense decomposition from E-HowNet for word similarity measurement
Author
Cheng-Wei Shih ; Yu-Lun Hsieh ; Wen-Lian Hsu
Author_Institution
Inst. of Inf. Sci., Taipei, Taiwan
fYear
2014
Firstpage
613
Lastpage
618
Abstract
In this paper, a novel Chinese word similarity measuring algorithm is proposed. It utilizes the information in Chinese dictionaries and decomposition of all the word definition in E-HowNet into semantic attributes which include hidden relationships and meanings. The extracted semantic attributes are regarded as semantic vectors for word similarity measurement. Evaluation results not only show that our word similarity measuring algorithm can achieve a higher correlation with human annotation than other methods, but also can successfully solve the unique edge distance and indistinguishability problem in taxonomy-based word similarity measurement.
Keywords
correlation methods; dictionaries; natural language processing; string matching; text analysis; Chinese dictionaries; Chinese word similarity measuring algorithm; E-HowNet; human annotation; semantic attribute extraction; semantic attributes; sense decomposition; taxonomy-based word similarity measurement; word definition decomposition; Atmospheric measurements; Dictionaries; Extraterrestrial measurements; Feature extraction; Semantics; Taxonomy; Vectors;
fLanguage
English
Publisher
ieee
Conference_Titel
Information Reuse and Integration (IRI), 2014 IEEE 15th International Conference on
Type
conf
DOI
10.1109/IRI.2014.7051946
Filename
7051946
Link To Document