شماره ركورد كنفرانس :
3237
عنوان مقاله :
Ontology Extraction from HTML Pages of one Domain
Author/Authors :
Mohammad Kadkhoda Faculty Member of Mathematics & Informatics Research Group, ACECR at T. M. U. Tehran , Mohammad Reza Pourkhani Computer Group of Institute for Higher Education ACECR Khuzestan Khuzestan
كليدواژه :
HTML parsing , content management system , ontology extraction , Semantic web , component
عنوان كنفرانس :
كنفرانس بين المللي وب پژوهي
چكيده لاتين :
Most relational databases in the web are accessiblethrough the HTML pages. This subject has been shown ontologyextraction from web relational database may be to do by parsingof these HTML pages. Especially, when we don’t have access tothe relational schema, this method is effectively doing. In thispaper, we proposed a new method for extracting ontology ofrelational databases in the web, which are managing by contentmanagement system (CMS). Our method is a reverse engineeringfor determine template of CMS such that we can to define pseudoschema of relational database and retrieval its data. Then wehave building ontology of domain as a new relational database.Moreover, HTML schema and HTML tags has been used asinformation to define ontology classes, object and dataproperties. Our method has more benefit of the mappingmethods that has transfer database to ontology, directly.