شماره ركورد كنفرانس :
922
عنوان مقاله :
Ontology Extraction from HTML Pages of one Domain
پديدآورندگان :
Kadkhoda Mohammad نويسنده , Pourkhani Mohammad Reza نويسنده
كليدواژه :
Semantic Web , Ontology extraction , content management system , HTML parsing
عنوان كنفرانس :
مجموعه مقالات اولين كنفرانس بين المللي وب پژوهي
چكيده فارسي :
Most relational databases in the web are accessible
through the HTML pages. This subject has been shown ontology
extraction from web relational database may be to do by parsing
of these HTML pages. Especially, when we don’t have access to
the relational schema, this method is effectively doing. In this
paper, we proposed a new method for extracting ontology of
relational databases in the web, which are managing by content
management system (CMS). Our method is a reverse engineering
for determine template of CMS such that we can to define pseudo
schema of relational database and retrieval its data. Then we
have building ontology of domain as a new relational database.
Moreover, HTML schema and HTML tags has been used as
information to define ontology classes, object and data
properties. Our method has more benefit of the mapping
methods that has transfer database to ontology, directly.
شماره مدرك كنفرانس :
3967648