DocumentCode :
3447660
Title :
Automatic structured Web databases classification
Author :
Cui, XiaoJun ; Ren, ZhongSheng ; Xiao, HongYu ; Le Xu
Author_Institution :
Wenzhou Vocational Coll. of Sci. & Technol., Wenzhou, China
Volume :
3
fYear :
2010
fDate :
29-31 Oct. 2010
Firstpage :
305
Lastpage :
309
Abstract :
The growing structured Web databases on the web, making large-scale Deep Web data integration faces enormous challenges. Organizing such structured web databases into a hierarchy directory tree is one of critical step towards the large-scale integration of Deep Web. In this paper, a method for automatic classification of Web database is addressed. Firstly, the method for calculating the semantic similarities among the Web databases based on their interface schemas is proposed and be translated to the problem of extended optimal matching for bipartite graph. Then based on the achieved similarity matrix, an agglomerative hierarchical clustering algorithm is proposed, which can organize the Web databases into a hierarchy tree automatically. Theoretical analysis and experimental results show that the method is efficient.
Keywords :
Web sites; database management systems; pattern classification; trees (mathematics); Web data integration; agglomerative hierarchical clustering algorithm; automatic structured Web databases classification; bipartite graph; extended optimal matching; hierarchy directory tree; interface scheme; similarity matrix; Artificial neural networks; Databases; Motion pictures; Nickel; bipartite graph matching; hierarchical clustering; interface schema; web databases;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Intelligent Computing and Intelligent Systems (ICIS), 2010 IEEE International Conference on
Conference_Location :
Xiamen
Print_ISBN :
978-1-4244-6582-8
Type :
conf
DOI :
10.1109/ICICISYS.2010.5658701
Filename :
5658701
Link To Document :
بازگشت