DocumentCode :
2348111
Title :
Context-based term identification and extraction for ontology construction
Author :
Goh, Hui-Ngo ; Kiu, Ching-Chieh
Author_Institution :
Fac. of Inf. Technol., Multimedia Univ., Cyberjaya, Malaysia
fYear :
2010
fDate :
21-23 Aug. 2010
Firstpage :
1
Lastpage :
7
Abstract :
Ontology construction often requires a domain specific corpus in conceptualizing the domain knowledge; specifically, it is an association of terms, relation between terms and related instances. It is a vital task to identify a list of significant term for constructing a practical ontology. In this paper, we present the use of a context-based term identification and extraction methodology for ontology construction from text document. The methodology is using a taxonomy and Wikipedia to support automatic term identification and extraction from structured documents with an assumption of candidate terms for a topic are often associated with its topic-specific keywords. A hierarchical relationship of super-topics and sub-topics is defined by a taxonomy, meanwhile, Wikipedia is used to provide context and background knowledge for topics that defined in the taxonomy to guide the term identification and extraction. The experimental results have shown the context-based term identification and extraction methodology is viable in defining topic concepts and its sub-concepts for constructing ontology. The experimental results have also proven its viability to be applied in a small corpus / text size environment in supporting ontology construction.
Keywords :
Web sites; document handling; ontologies (artificial intelligence); text analysis; Wikipedia; automatic term identification; context-based term identification; extraction methodology; ontology construction; text document; Games; Textiles; Transportation; Ontology construction; Taxonomy; Term identification and extraction; Wikipedia;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Natural Language Processing and Knowledge Engineering (NLP-KE), 2010 International Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4244-6896-6
Type :
conf
DOI :
10.1109/NLPKE.2010.5587801
Filename :
5587801
Link To Document :
بازگشت