Title :
An automatic ontology population with a machine learning technique from semi-structured documents
Author :
Song, Hyun-Je ; Park, Seong-Bae ; Park, Se-Young
Author_Institution :
Dept. of Comput. Eng., Kyungpook Nat. Univ., Daegu, South Korea
Abstract :
The manual design of an ontology usually defines the concepts for the domain, but the individual instances of the concepts are often missing though they are important in using the ontology as a knowledge base. This is due to high cost of the manual construction of individuals. In order to tackle this problem, this paper proposes an automatic method for ontology population. The knowledge source for ontology population used in this paper is the Web tables of which structure is relatively well organized. Since a Web table can be analyzed into a parse tree, the most appropriate concept within the ontology for a given Web table is determined by a kernel method, so-called a parse tree kernel. Then, the table is populated as an individual of the concept. According to the experimental results on a large ontology with a great number of concepts, the proposed method achieves 62.35% of accuracy for a number of Web tables.
Keywords :
grammars; learning (artificial intelligence); ontologies (artificial intelligence); Web tables; automatic ontology population; machine learning; parse tree kernel; semi-structured documents; Data mining; HTML; Humans; Kernel; Machine learning; Ontologies; Semantic Web; Tree graphs; Web pages; XML;
Conference_Titel :
Information and Automation, 2009. ICIA '09. International Conference on
Conference_Location :
Zhuhai, Macau
Print_ISBN :
978-1-4244-3607-1
Electronic_ISBN :
978-1-4244-3608-8
DOI :
10.1109/ICINFA.2009.5204981