• DocumentCode
    3138125
  • Title

    A New Model of Information Content for Semantic Similarity in WordNet

  • Author

    Zhou, Zili ; Wang, Yanna ; Gu, Junzhong

  • Author_Institution
    Coll. of Phys. & Eng., Qufu Normal Univ., Qufu
  • Volume
    3
  • fYear
    2008
  • fDate
    13-15 Dec. 2008
  • Firstpage
    85
  • Lastpage
    89
  • Abstract
    Information Content (IC) is an important dimension of assessing the semantic similarity between two terms or word senses in word knowledge. The conventional method of obtaining IC of word senses is to combine knowledge of their hierarchical structure from an ontology like WordNet with actual usage in text as derived from a large corpus. In this paper, a new model of IC is presented, which relies on hierarchical structure alone. The model considers not only the hyponyms of each word sense but also its depth in the structure. The IC value is easier to calculate based on our model, and when used as the basis of a similarity approach it yields judgments that correlate more closely with human assessments than others, which using IC value obtained only considering the hyponyms and IC value got by employing corpus analysis.
  • Keywords
    information retrieval; ontologies (artificial intelligence); WordNet; corpus analysis; human assessments; information content; semantic similarity; word knowledge; Artificial intelligence; Computer science; Conferences; Educational institutions; History; Humans; Integrated circuit modeling; Knowledge engineering; Ontologies; Physics; Information Content; Semantic Similarity; WordNet;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Future Generation Communication and Networking Symposia, 2008. FGCNS '08. Second International Conference on
  • Conference_Location
    Sanya
  • Print_ISBN
    978-1-4244-3430-5
  • Electronic_ISBN
    978-0-7695-3546-3
  • Type

    conf

  • DOI
    10.1109/FGCNS.2008.16
  • Filename
    4813554