• DocumentCode
    1230759
  • Title

    An approach for measuring semantic similarity between words using multiple information sources

  • Author

    Li, Yuhua ; Bandar, Zuhair A. ; Mclean, David

  • Author_Institution
    Manchester Sch. of Eng., Manchester Univ., UK
  • Volume
    15
  • Issue
    4
  • fYear
    2003
  • Firstpage
    871
  • Lastpage
    882
  • Abstract
    Semantic similarity between words is becoming a generic problem for many applications of computational linguistics and artificial intelligence. This paper explores the determination of semantic similarity by a number of information sources, which consist of structural semantic information from a lexical taxonomy and information content from a corpus. To investigate how information sources could be used effectively, a variety of strategies for using various possible information sources are implemented. A new measure is then proposed which combines information sources nonlinearly. Experimental evaluation against a benchmark set of human similarity ratings demonstrates that the proposed measure significantly outperforms traditional similarity measures.
  • Keywords
    artificial intelligence; computational linguistics; information resources; information retrieval; natural languages; artificial intelligence; computational linguistics; experimental evaluation; human similarity ratings; information sources; lexical database; lexical taxonomy; multiple information sources; natural language; structural semantic information; word semantic similarity measurement; Artificial intelligence; Computational linguistics; Databases; Helium; Humans; Image retrieval; Joining processes; Natural language processing; Statistics; Taxonomy;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/TKDE.2003.1209005
  • Filename
    1209005