• DocumentCode
    2582777
  • Title

    A wrapper induction application with knowledge base support: a use case for initiation and maintenance of wrappers

  • Author

    Miled, Z.B. ; Farooq, Ali ; Mahoui, Malika ; Li, Nianhua ; Dippold, Mindi ; Bukhres, Omran

  • Author_Institution
    Dept. of Electr. & Comput. Eng., IUPUI, Indianapolis, IN, USA
  • fYear
    2005
  • fDate
    19-21 Oct. 2005
  • Firstpage
    65
  • Lastpage
    72
  • Abstract
    Integrating life science Web databases, while important and necessary, is a challenge for current integration systems mainly due to the large number of these databases, their heterogeneity and the fact that their interfaces may change often. BACIIS, a biological and chemical information integration system, is a tightly coupled federated database system that uses the mediator wrapper method in order to retrieve information from several remote Web databases. BACIIS relies on a semi-automated approach for generating and maintaining wrappers in order to provide a scalable system with a limited maintenance overhead. The semi-automatic wrapper induction in BACIIS is efficient because it is based on, but not limited to a domain knowledge. Tests show that the use of ontology increases the accuracy of the wrapper induction. We also present how the wrapper induction system facilitates wrapper update, and assists in the information extraction. By using a wrapper induction system for creation and maintenance of wrappers, scalability, flexibility, and stability of the integrated information system is easily maintained.
  • Keywords
    Internet; biology computing; information retrieval systems; knowledge based systems; ontologies (artificial intelligence); BACIIS wrapper induction system; biological-chemical information integration system; information extraction; information retrieval; knowledge base support; life science Web databases; mediator wrapper method; ontology; remote Web databases; semi-automatic wrapper induction; tightly coupled federated database system; Chemicals; Data mining; Database systems; Induction generators; Information retrieval; Information systems; Ontologies; Scalability; Stability; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Bioinformatics and Bioengineering, 2005. BIBE 2005. Fifth IEEE Symposium on
  • Print_ISBN
    0-7695-2476-1
  • Type

    conf

  • DOI
    10.1109/BIBE.2005.11
  • Filename
    1544450