DocumentCode :
2427373
Title :
A Machine Learning Approach Classification of Deep Web Sources
Author :
Xu, Hexiang ; Zhang, Chenghong ; Hao, Xiulan ; Hu, Yunfa
Author_Institution :
Fudan Univ., Shanghai
Volume :
4
fYear :
2007
fDate :
24-27 Aug. 2007
Firstpage :
561
Lastpage :
565
Abstract :
The classification of deep Web sources is an important area in large-scale deep Web integration, which is still at an early stage. Many deep web sources are structured by providing structured query interfaces and results. Classifying such structured sources into domains is one of the critical steps toward the integration of heterogeneous Web sources. To date, in terms of the classification, existing works mainly focus on classifying texts or Web documents, and there is little in the deep web. In this paper, we present a deep Web model and machine learning based classifying model. The experimental results show that we can achieve a good performance with a small scale training samples for each domain, and as the number of training samples increases, the performance keeps stabilization.
Keywords :
Internet; classification; learning (artificial intelligence); deep Web sources classification; machine learning; Databases; Information technology; Internet; Large scale integration; Machine learning; Oceans; Sea surface; Search engines; Technology management; Web pages;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Fuzzy Systems and Knowledge Discovery, 2007. FSKD 2007. Fourth International Conference on
Conference_Location :
Haikou
Print_ISBN :
978-0-7695-2874-8
Type :
conf
DOI :
10.1109/FSKD.2007.54
Filename :
4406450
Link To Document :
بازگشت