Title :
A Holistic Approach on Deep Web Schema Matching
Author :
Zhong, Xin ; Fu, Yuchen ; Liu, Quan ; Lin, Xinghong ; Cui, Zhiming
Author_Institution :
Soochow Univ., Taipei
Abstract :
Schema matching is a critical problem in Deep Web integration process. This paper introduces a holistic approach, to match many schemas at the same time and find all matchings at once. We mainly analyses and compares the two existent archetypal systems: MGS and DCM. Furthermore, propose a new algorithm, named Correlated-clustering, based on advantages of the two existent systems. This algorithm first mines group attributes by positively correlated attributes, and then clusters the concepts by calculating the similarity of each two concepts, finally, develop a strategy to select matching. The experiment result shows the effectiveness and completeness of our algorithm, which demonstrate the promise of holistic schema matching.
Keywords :
Internet; data mining; pattern clustering; DCM archetypal system; Deep Web schema matching; MGS archetypal system; correlated-clustering; group attributes mining; Clustering algorithms; Databases; Floods; Graphical user interfaces; HTML; Information technology; Internet; Large-scale systems; Uniform resource locators; Web server;
Conference_Titel :
Convergence Information Technology, 2007. International Conference on
Conference_Location :
Gyeongju
Print_ISBN :
0-7695-3038-9
DOI :
10.1109/ICCIT.2007.29