DocumentCode :
2727988
Title :
Discovering Subsumption Hierarchies of Ontology Concepts from Text Corpora
Author :
Zavitsanos, E. ; Paliouras, G. ; Vouros, G.A.
Author_Institution :
Inst. of Informatics & Telecommun., Aghia Paraskevi
fYear :
2007
fDate :
2-5 Nov. 2007
Firstpage :
402
Lastpage :
408
Abstract :
This paper proposes a method for learning ontologies given a corpus of text documents. The method identifies concepts in documents and organizes them into a subsumption hierarchy, without presupposing the existence of a seed ontology. The method uncovers latent topics in terms of which document text is being generated. These topics form the concepts of the new ontology. This is done in a language neutral way, using probabilistic space reduction techniques over the original term space of the corpus. Given multiple sets of concepts (latent topics) being discovered, the proposed method constructs a subsumption hierarchy by performing conditional independence tests among pairs of latent topics, given a third one. The paper provides experimental results over the GENIA corpus from the domain of biomedicine.
Keywords :
data mining; learning (artificial intelligence); ontologies (artificial intelligence); text analysis; latent topics; ontologies learning; ontology concepts; probabilistic space reduction; seed ontology; subsumption hierarchies dicovery; text corpora; text documents; Biomedical engineering; Humans; Informatics; Linear discriminant analysis; Machine learning; Ontologies; Performance evaluation; Shape; Systems engineering and theory; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Web Intelligence, IEEE/WIC/ACM International Conference on
Conference_Location :
Fremont, CA
Print_ISBN :
978-0-7695-3026-0
Type :
conf
DOI :
10.1109/WI.2007.55
Filename :
4427123
Link To Document :
بازگشت