Title :
Sense decomposition from E-HowNet for word similarity measurement
Author :
Cheng-Wei Shih ; Yu-Lun Hsieh ; Wen-Lian Hsu
Author_Institution :
Inst. of Inf. Sci., Taipei, Taiwan
Abstract :
In this paper, a novel Chinese word similarity measuring algorithm is proposed. It utilizes the information in Chinese dictionaries and decomposition of all the word definition in E-HowNet into semantic attributes which include hidden relationships and meanings. The extracted semantic attributes are regarded as semantic vectors for word similarity measurement. Evaluation results not only show that our word similarity measuring algorithm can achieve a higher correlation with human annotation than other methods, but also can successfully solve the unique edge distance and indistinguishability problem in taxonomy-based word similarity measurement.
Keywords :
correlation methods; dictionaries; natural language processing; string matching; text analysis; Chinese dictionaries; Chinese word similarity measuring algorithm; E-HowNet; human annotation; semantic attribute extraction; semantic attributes; sense decomposition; taxonomy-based word similarity measurement; word definition decomposition; Atmospheric measurements; Dictionaries; Extraterrestrial measurements; Feature extraction; Semantics; Taxonomy; Vectors;
Conference_Titel :
Information Reuse and Integration (IRI), 2014 IEEE 15th International Conference on
DOI :
10.1109/IRI.2014.7051946