Title : 
Towards the Automatic Learning of Ontologies
         
        
            Author : 
Ocampo-Guzman, Isidra ; Lopez-Arevalo, Ivan ; Tello-Leal, Edgar ; Sosa-Sosa, Victor
         
        
            Author_Institution : 
Lab. de Tecnol. de Informacion, Cinvestav-Tamaulipas, Tamaulipas, Mexico
         
        
        
        
        
        
            Abstract : 
This paper proposes a methodology for the automatic learning of ontologies from a text corpus. The concepts (topics) from documents into the corpus are identified by using the Latent Dirichlet Allocation model. Based on theset of identified topics, for each concept it is constructed its taxonomy by using the terms with greater probability which contribute to define it. WordNet is usedin the construction of these partial topic taxonomies by obtaining the similarity and relatedness between the terms that constitute each topic. The resulting taxonomies are joined to structure the final ontology. The methodology is evaluated with the Lonely Planet corpus.
         
        
            Keywords : 
Content addressable storage; Humans; Information management; Ontologies; Visualization; Latent Dirichlet Allocation; Ontology construction; WordNet;
         
        
        
        
            Conference_Titel : 
Information and Human Language Technology (STIL), 2009 Seventh Brazilian Symposium in
         
        
            Conference_Location : 
Sao Carlos, TBD, Brazil
         
        
            Print_ISBN : 
978-1-4244-6008-3
         
        
        
            DOI : 
10.1109/STIL.2009.23