Title :
An efficient wrapper generation in DIMS
Author_Institution :
CNRS, Univ. de Valenciennes et du Hainaut Cambresis, France
Abstract :
We discuss the problem of distributed, autonomous and heterogeneous Web data sources wrapper, describe an efficient innovate wrapper generation for access, retrieve these information. Our approach is an XML-based methodology whose goal extends for beyond simple "screen scrapping". Our system architecture is based on famous mediator-wrapper architecture. We use some efficient tools, such as Fatdog\´s XQEngine, Jarkarta Lucene, Jtidy, JDBC or JDBC-ODBC bridge drivers, to facile wrapper heterogeneous Web sources into standard XML data, and translate the user\´s query to related wrapper using mediator and integrate these results. For demonstration, we develop distributed information management system (DIMS) framework that is an XML-based, mediator-wrapper architecture information integration system for accessing these Web sources. An approach for DIMS wrapper generation is mainly discussed.
Keywords :
Internet; XML; information retrieval; Fatdog XQEngine tool; JDBC-ODBC bridge drivers tool; Jarkarta Lucene tool; Jtidy tool; Web data sources wrapper; XML; XML-based mediator-wrapper architecture; distributed information management system; information extraction; information integration system; screen scrapping; Bridges; Data mining; Data models; HTML; Information management; Information retrieval; Internet; Service oriented architecture; Wrapping; XML;
Conference_Titel :
Information Technology: Research and Education, 2003. Proceedings. ITRE2003. International Conference on
Print_ISBN :
0-7803-7724-9
DOI :
10.1109/ITRE.2003.1270674