DocumentCode :
2578924
Title :
A Deep Web Database Sampling Method Based on High Correlation Keywords
Author :
Zheng, Yongqing ; Bian, Yufang ; Du, Xin ; Wu, Hongchen
Author_Institution :
Sch. of Comput. Sci. & Technol., Shandong Univ., Jinan, China
fYear :
2012
fDate :
16-18 Nov. 2012
Firstpage :
9
Lastpage :
14
Abstract :
Evaluation of the Deep Web data sources must be based on the data in the Web databases, then how to select the most representative keywords as a query word to obtain a large number of uniformly distributed data is a major difficulty, this paper proposed a Deep Web database sampling method based on high correlation keyword, using a graph based keyword-connected network to get query words, the method can get a random sample of high-quality data from the Deep Web data source more efficiently.
Keywords :
distributed databases; information retrieval systems; query processing; deep Web data sources; deep Web database sampling method; graph based keyword connected network; high correlation keywords; query word; Correlation; Data mining; Distributed databases; Educational institutions; Knowledge engineering; Sampling methods; Deep Web Sampling; High Correlation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Web Information Systems and Applications Conference (WISA), 2012 Ninth
Conference_Location :
Haikou
Print_ISBN :
978-1-4673-3054-1
Type :
conf
DOI :
10.1109/WISA.2012.25
Filename :
6385174
Link To Document :
بازگشت