DocumentCode
3447660
Title
Automatic structured Web databases classification
Author
Cui, XiaoJun ; Ren, ZhongSheng ; Xiao, HongYu ; Le Xu
Author_Institution
Wenzhou Vocational Coll. of Sci. & Technol., Wenzhou, China
Volume
3
fYear
2010
fDate
29-31 Oct. 2010
Firstpage
305
Lastpage
309
Abstract
The growing structured Web databases on the web, making large-scale Deep Web data integration faces enormous challenges. Organizing such structured web databases into a hierarchy directory tree is one of critical step towards the large-scale integration of Deep Web. In this paper, a method for automatic classification of Web database is addressed. Firstly, the method for calculating the semantic similarities among the Web databases based on their interface schemas is proposed and be translated to the problem of extended optimal matching for bipartite graph. Then based on the achieved similarity matrix, an agglomerative hierarchical clustering algorithm is proposed, which can organize the Web databases into a hierarchy tree automatically. Theoretical analysis and experimental results show that the method is efficient.
Keywords
Web sites; database management systems; pattern classification; trees (mathematics); Web data integration; agglomerative hierarchical clustering algorithm; automatic structured Web databases classification; bipartite graph; extended optimal matching; hierarchy directory tree; interface scheme; similarity matrix; Artificial neural networks; Databases; Motion pictures; Nickel; bipartite graph matching; hierarchical clustering; interface schema; web databases;
fLanguage
English
Publisher
ieee
Conference_Titel
Intelligent Computing and Intelligent Systems (ICIS), 2010 IEEE International Conference on
Conference_Location
Xiamen
Print_ISBN
978-1-4244-6582-8
Type
conf
DOI
10.1109/ICICISYS.2010.5658701
Filename
5658701
Link To Document