DocumentCode
3138125
Title
A New Model of Information Content for Semantic Similarity in WordNet
Author
Zhou, Zili ; Wang, Yanna ; Gu, Junzhong
Author_Institution
Coll. of Phys. & Eng., Qufu Normal Univ., Qufu
Volume
3
fYear
2008
fDate
13-15 Dec. 2008
Firstpage
85
Lastpage
89
Abstract
Information Content (IC) is an important dimension of assessing the semantic similarity between two terms or word senses in word knowledge. The conventional method of obtaining IC of word senses is to combine knowledge of their hierarchical structure from an ontology like WordNet with actual usage in text as derived from a large corpus. In this paper, a new model of IC is presented, which relies on hierarchical structure alone. The model considers not only the hyponyms of each word sense but also its depth in the structure. The IC value is easier to calculate based on our model, and when used as the basis of a similarity approach it yields judgments that correlate more closely with human assessments than others, which using IC value obtained only considering the hyponyms and IC value got by employing corpus analysis.
Keywords
information retrieval; ontologies (artificial intelligence); WordNet; corpus analysis; human assessments; information content; semantic similarity; word knowledge; Artificial intelligence; Computer science; Conferences; Educational institutions; History; Humans; Integrated circuit modeling; Knowledge engineering; Ontologies; Physics; Information Content; Semantic Similarity; WordNet;
fLanguage
English
Publisher
ieee
Conference_Titel
Future Generation Communication and Networking Symposia, 2008. FGCNS '08. Second International Conference on
Conference_Location
Sanya
Print_ISBN
978-1-4244-3430-5
Electronic_ISBN
978-0-7695-3546-3
Type
conf
DOI
10.1109/FGCNS.2008.16
Filename
4813554
Link To Document