• DocumentCode
    3570929
  • Title

    Sense decomposition from E-HowNet for word similarity measurement

  • Author

    Cheng-Wei Shih ; Yu-Lun Hsieh ; Wen-Lian Hsu

  • Author_Institution
    Inst. of Inf. Sci., Taipei, Taiwan
  • fYear
    2014
  • Firstpage
    613
  • Lastpage
    618
  • Abstract
    In this paper, a novel Chinese word similarity measuring algorithm is proposed. It utilizes the information in Chinese dictionaries and decomposition of all the word definition in E-HowNet into semantic attributes which include hidden relationships and meanings. The extracted semantic attributes are regarded as semantic vectors for word similarity measurement. Evaluation results not only show that our word similarity measuring algorithm can achieve a higher correlation with human annotation than other methods, but also can successfully solve the unique edge distance and indistinguishability problem in taxonomy-based word similarity measurement.
  • Keywords
    correlation methods; dictionaries; natural language processing; string matching; text analysis; Chinese dictionaries; Chinese word similarity measuring algorithm; E-HowNet; human annotation; semantic attribute extraction; semantic attributes; sense decomposition; taxonomy-based word similarity measurement; word definition decomposition; Atmospheric measurements; Dictionaries; Extraterrestrial measurements; Feature extraction; Semantics; Taxonomy; Vectors;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information Reuse and Integration (IRI), 2014 IEEE 15th International Conference on
  • Type

    conf

  • DOI
    10.1109/IRI.2014.7051946
  • Filename
    7051946