• DocumentCode
    2597950
  • Title

    An efficient wrapper generation in DIMS

  • Author

    Fan, Linghua

  • Author_Institution
    CNRS, Univ. de Valenciennes et du Hainaut Cambresis, France
  • fYear
    2003
  • fDate
    11-13 Aug. 2003
  • Firstpage
    525
  • Lastpage
    529
  • Abstract
    We discuss the problem of distributed, autonomous and heterogeneous Web data sources wrapper, describe an efficient innovate wrapper generation for access, retrieve these information. Our approach is an XML-based methodology whose goal extends for beyond simple "screen scrapping". Our system architecture is based on famous mediator-wrapper architecture. We use some efficient tools, such as Fatdog\´s XQEngine, Jarkarta Lucene, Jtidy, JDBC or JDBC-ODBC bridge drivers, to facile wrapper heterogeneous Web sources into standard XML data, and translate the user\´s query to related wrapper using mediator and integrate these results. For demonstration, we develop distributed information management system (DIMS) framework that is an XML-based, mediator-wrapper architecture information integration system for accessing these Web sources. An approach for DIMS wrapper generation is mainly discussed.
  • Keywords
    Internet; XML; information retrieval; Fatdog XQEngine tool; JDBC-ODBC bridge drivers tool; Jarkarta Lucene tool; Jtidy tool; Web data sources wrapper; XML; XML-based mediator-wrapper architecture; distributed information management system; information extraction; information integration system; screen scrapping; Bridges; Data mining; Data models; HTML; Information management; Information retrieval; Internet; Service oriented architecture; Wrapping; XML;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information Technology: Research and Education, 2003. Proceedings. ITRE2003. International Conference on
  • Print_ISBN
    0-7803-7724-9
  • Type

    conf

  • DOI
    10.1109/ITRE.2003.1270674
  • Filename
    1270674