Title :
A Deep Web Database Sampling Method Based on High Correlation Keywords
Author :
Zheng, Yongqing ; Bian, Yufang ; Du, Xin ; Wu, Hongchen
Author_Institution :
Sch. of Comput. Sci. & Technol., Shandong Univ., Jinan, China
Abstract :
Evaluation of the Deep Web data sources must be based on the data in the Web databases, then how to select the most representative keywords as a query word to obtain a large number of uniformly distributed data is a major difficulty, this paper proposed a Deep Web database sampling method based on high correlation keyword, using a graph based keyword-connected network to get query words, the method can get a random sample of high-quality data from the Deep Web data source more efficiently.
Keywords :
distributed databases; information retrieval systems; query processing; deep Web data sources; deep Web database sampling method; graph based keyword connected network; high correlation keywords; query word; Correlation; Data mining; Distributed databases; Educational institutions; Knowledge engineering; Sampling methods; Deep Web Sampling; High Correlation;
Conference_Titel :
Web Information Systems and Applications Conference (WISA), 2012 Ninth
Conference_Location :
Haikou
Print_ISBN :
978-1-4673-3054-1
DOI :
10.1109/WISA.2012.25