DocumentCode :
604424
Title :
Interface schema matching with the machine learning for deep web
Author :
Guanwen Zhu ; Hongbin Wang ; Nianbin Wang ; Qianqian Jiao
Author_Institution :
Coll. of Comput. Sci. & Technol., Harbin Eng. Univ., Harbin, China
fYear :
2012
fDate :
29-31 Dec. 2012
Firstpage :
822
Lastpage :
825
Abstract :
With the rapid development of the World Wide Web, information contained in the deep web is increasing dramatically. Since different query interfaces are heterogeneous and autonomous inherently, even in the same domain, it is a huge challenge to allow users efficiently and quickly to get their own satisfying information. Deep web query interfaces integration can solve this problem well. The interface schema matching is the foremost step in the steps of the deep web query interfaces integration. This paper takes 120 data sources as a training set and 40 data sources as a testing set. Combined with the idea of multi-strategy learning technology, a deep web interface schema matching method based on machine learning is proposed. The method transformed the schema matching problem into the machine learning classification, and achieved the schema matching automatically. In order to enhance the accuracy of the mappings, the concept of domain ontology is introduced in this paper. The experimental results show that the method has an average accuracy rate of 80%-90%.
Keywords :
Internet; learning (artificial intelligence); ontologies (artificial intelligence); pattern classification; query processing; World Wide Web; automatic schema matching; autonomous query interfaces; deep Web query interface integration; domain ontology; heterogeneous query interfaces; interface schema matching; machine learning classification; mapping accuracy enhancement; multistrategy learning technology; deep web; domain ontology; meta-learning; multi-stratry learning; schema matching;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Science and Network Technology (ICCSNT), 2012 2nd International Conference on
Conference_Location :
Changchun
Print_ISBN :
978-1-4673-2963-7
Type :
conf
DOI :
10.1109/ICCSNT.2012.6526056
Filename :
6526056
Link To Document :
بازگشت