DocumentCode :
599194
Title :
Building semantic corpus from wordNet
Author :
Stanchev, Lubomir
Author_Institution :
Indiana Univ. - Purdue Univ. Fort Wayne, Fort Wayne, IN, USA
fYear :
2012
fDate :
4-7 Oct. 2012
Firstpage :
226
Lastpage :
231
Abstract :
We propose a novel methodology for extracting semantic similarity knowledge from semi-structured sources, such as WordNet. Unlike existing approaches that only explore the structured information (e.g., the hypernym relationship in WordNet), we present a framework that allows us to utilize all available information, including natural language descriptions. Our approach constructs a semantic corpus. It is represented using a graph that models the relationship between phrases using numbers. The data in the semantic corpus can be used to measure the similarity between phrases, the similarity between documents, or to perform a semantic search in a set of documents that uses the meaning of words and phrases (i.e., search that is not keyword-based).
Keywords :
feature extraction; graph theory; natural language processing; WordNet; graph; natural language descriptions; semantic corpus; semantic search; semantic similarity knowledge extraction; semistructured sources; Bayesian methods; Correlation; Frequency conversion; Humans; Natural languages; Semantics; Wheelchairs;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Bioinformatics and Biomedicine Workshops (BIBMW), 2012 IEEE International Conference on
Conference_Location :
Philadelphia, PA
Print_ISBN :
978-1-4673-2746-6
Electronic_ISBN :
978-1-4673-2744-2
Type :
conf
DOI :
10.1109/BIBMW.2012.6470308
Filename :
6470308
Link To Document :
بازگشت