DocumentCode :
2331400
Title :
An Effective Content-Based Schema Matching Algorithm
Author :
Yang, Yuan ; Chen, Mengdong ; Gao, Bin
Author_Institution :
Dept. of Comput. Sci. & Technol., Beihang Univ., Beijing
fYear :
2008
fDate :
20-20 Nov. 2008
Firstpage :
7
Lastpage :
11
Abstract :
Identifying database corresponding attributes in schema matching plays a key role in data integration in heterogeneous databases. Most of current approaches mainly use schema information of attribute. Little research has attempted to fully explore the use of data content. This paper introduces a novel schema matching algorithm based on data content, which has two-step process. First, through the analysis of the data pattern, we train a set of neural networks which used for calculating candidate matching pairs. Then we apply a rule-based algorithm to filter the candidate pairs and get correct matching result. The experiment result based on real data shows our proposed approach can improve the precision and recall of schema matching obviously. The approach can either be used independently or work together with other schema matching methods.
Keywords :
distributed databases; knowledge based systems; neural nets; content-based schema matching algorithm; data content; data integration; data pattern; heterogeneous databases; neural networks; rule-based algorithm; Content management; Data engineering; Databases; Engineering management; Information management; Information technology; Neural networks; Pattern matching; Seminars; Technology management; data integration; database; schema matching;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Future Information Technology and Management Engineering, 2008. FITME '08. International Seminar on
Conference_Location :
Leicestershire, United Kingdom
Print_ISBN :
978-0-7695-3480-0
Type :
conf
DOI :
10.1109/FITME.2008.38
Filename :
4746429
Link To Document :
بازگشت