DocumentCode
2597950
Title
An efficient wrapper generation in DIMS
Author
Fan, Linghua
Author_Institution
CNRS, Univ. de Valenciennes et du Hainaut Cambresis, France
fYear
2003
fDate
11-13 Aug. 2003
Firstpage
525
Lastpage
529
Abstract
We discuss the problem of distributed, autonomous and heterogeneous Web data sources wrapper, describe an efficient innovate wrapper generation for access, retrieve these information. Our approach is an XML-based methodology whose goal extends for beyond simple "screen scrapping". Our system architecture is based on famous mediator-wrapper architecture. We use some efficient tools, such as Fatdog\´s XQEngine, Jarkarta Lucene, Jtidy, JDBC or JDBC-ODBC bridge drivers, to facile wrapper heterogeneous Web sources into standard XML data, and translate the user\´s query to related wrapper using mediator and integrate these results. For demonstration, we develop distributed information management system (DIMS) framework that is an XML-based, mediator-wrapper architecture information integration system for accessing these Web sources. An approach for DIMS wrapper generation is mainly discussed.
Keywords
Internet; XML; information retrieval; Fatdog XQEngine tool; JDBC-ODBC bridge drivers tool; Jarkarta Lucene tool; Jtidy tool; Web data sources wrapper; XML; XML-based mediator-wrapper architecture; distributed information management system; information extraction; information integration system; screen scrapping; Bridges; Data mining; Data models; HTML; Information management; Information retrieval; Internet; Service oriented architecture; Wrapping; XML;
fLanguage
English
Publisher
ieee
Conference_Titel
Information Technology: Research and Education, 2003. Proceedings. ITRE2003. International Conference on
Print_ISBN
0-7803-7724-9
Type
conf
DOI
10.1109/ITRE.2003.1270674
Filename
1270674
Link To Document