DocumentCode :
3730546
Title :
A deep web query interface discovery method
Author :
Bo Liu;Zhenxing Li
Author_Institution :
Department of Computer Science, Jinan University, Guangzhou, China
fYear :
2015
Firstpage :
1340
Lastpage :
1344
Abstract :
For the purpose of obtaining deep web query interface from forms accurately, this paper proposes a framework of automatic deep web discovery, which includes procedures of collecting web pages, extracting forms and features, filtering forms, and identifying forms. A heuristic rule-based k-nearest neighbor algorithm for identifying the query interfaces is introduced. In the experiments, a number of query interfaces and non-query interfaces from different domains are selected for classifying the query interfaces. Experimental results demonstrate that the presented algorithm can significantly improve the accuracy of deep web query interface discovery.
Keywords :
"Feature extraction","Training","Classification algorithms","Databases","Web pages","Filtering","Search engines"
Publisher :
ieee
Conference_Titel :
Fuzzy Systems and Knowledge Discovery (FSKD), 2015 12th International Conference on
Type :
conf
DOI :
10.1109/FSKD.2015.7382138
Filename :
7382138
Link To Document :
بازگشت