Title :
OntoBuilder: fully automatic extraction and consolidation of ontologies from Web sources
Author :
Gal, Avigdor ; Modica, Giovanni ; Jamil, Hasan
Author_Institution :
Technion-Israel Inst. of Technol., Haifa, Israel
fDate :
30 March-2 April 2004
Abstract :
Ontologies, formal specifications of domains, have evolved in recent years as a leading tool in representing and interpreting Web data. The OntoBuilder project supports the extraction of ontologies from Web search interfaces, ranging from simple search engine forms to multiple-pages, complex reservation systems. OntoBuilder enables fully-automatic ontology matching. The use of ontologies, as opposed to relational schema or XML, as an underlying data model allows a flexible representation of metadata, that can be tailored to many different types of applications. OntoBuilder was developed using Java, which makes it portable to various platforms and operating system environments. We demonstrate OntoBuilder using an easy-to-follow example of matching car rental ontologies. The system creates ontologies of car rental Web sites on-the-fly, and combine them into a global ontology. The benefits of OntoBuilder in resolving, in an automatic manner, semantic heterogeneity, including synonyms and designer errors are highlighted.
Keywords :
Internet; Web sites; XML; information retrieval; meta data; pattern matching; search engines; Internet; Java; OntoBuilder project; Web data; Web search interfaces; Web sources; XML; car rental Web sites; car rental ontology matching; data model; formal specifications; fully automatic ontology extraction; fully-automatic ontology matching; metadata; search engine; semantic heterogeneity; Data mining; Data models; Filling; Formal specifications; Java; Ontologies; Operating systems; Search engines; Web search; XML;
Conference_Titel :
Data Engineering, 2004. Proceedings. 20th International Conference on
Print_ISBN :
0-7695-2065-0
DOI :
10.1109/ICDE.2004.1320082