• DocumentCode
    2331400
  • Title

    An Effective Content-Based Schema Matching Algorithm

  • Author

    Yang, Yuan ; Chen, Mengdong ; Gao, Bin

  • Author_Institution
    Dept. of Comput. Sci. & Technol., Beihang Univ., Beijing
  • fYear
    2008
  • fDate
    20-20 Nov. 2008
  • Firstpage
    7
  • Lastpage
    11
  • Abstract
    Identifying database corresponding attributes in schema matching plays a key role in data integration in heterogeneous databases. Most of current approaches mainly use schema information of attribute. Little research has attempted to fully explore the use of data content. This paper introduces a novel schema matching algorithm based on data content, which has two-step process. First, through the analysis of the data pattern, we train a set of neural networks which used for calculating candidate matching pairs. Then we apply a rule-based algorithm to filter the candidate pairs and get correct matching result. The experiment result based on real data shows our proposed approach can improve the precision and recall of schema matching obviously. The approach can either be used independently or work together with other schema matching methods.
  • Keywords
    distributed databases; knowledge based systems; neural nets; content-based schema matching algorithm; data content; data integration; data pattern; heterogeneous databases; neural networks; rule-based algorithm; Content management; Data engineering; Databases; Engineering management; Information management; Information technology; Neural networks; Pattern matching; Seminars; Technology management; data integration; database; schema matching;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Future Information Technology and Management Engineering, 2008. FITME '08. International Seminar on
  • Conference_Location
    Leicestershire, United Kingdom
  • Print_ISBN
    978-0-7695-3480-0
  • Type

    conf

  • DOI
    10.1109/FITME.2008.38
  • Filename
    4746429