Title :
Towards the Automatic Learning of Ontologies
Author :
Ocampo-Guzman, Isidra ; Lopez-Arevalo, Ivan ; Tello-Leal, Edgar ; Sosa-Sosa, Victor
Author_Institution :
Lab. de Tecnol. de Informacion, Cinvestav-Tamaulipas, Tamaulipas, Mexico
Abstract :
This paper proposes a methodology for the automatic learning of ontologies from a text corpus. The concepts (topics) from documents into the corpus are identified by using the Latent Dirichlet Allocation model. Based on theset of identified topics, for each concept it is constructed its taxonomy by using the terms with greater probability which contribute to define it. WordNet is usedin the construction of these partial topic taxonomies by obtaining the similarity and relatedness between the terms that constitute each topic. The resulting taxonomies are joined to structure the final ontology. The methodology is evaluated with the Lonely Planet corpus.
Keywords :
Content addressable storage; Humans; Information management; Ontologies; Visualization; Latent Dirichlet Allocation; Ontology construction; WordNet;
Conference_Titel :
Information and Human Language Technology (STIL), 2009 Seventh Brazilian Symposium in
Conference_Location :
Sao Carlos, TBD, Brazil
Print_ISBN :
978-1-4244-6008-3
DOI :
10.1109/STIL.2009.23