Title :
Hypothesis Testing Based Knowledge Discovery in Distributed Multiple Data Sources
Author :
Bai, Shilei ; Ren, Hui ; Jiang, Wei ; Jiang, Yujiang
Author_Institution :
Sch. of Inf. Eng., Commun. Univ. of China, Beijing, China
Abstract :
In the past several years, most data mining researchers focus on data mining from single data source. Nowadays, data mining from multiple data sources is a new problem in Web environment and is also an efficient technique for solving knowledge discovery in distributed databases. A new method for mining multi-data sources is presented in this paper. By sharing knowledge patterns discovered in other similar data sources, hypothesis testing is employed for verifying whether the patterns are also suitable for local data source or not. So that can improve the efficiency of KDD greatly. Finally the effectiveness of this method is analyzed and experimental result is given. This method can be extended as an efficient data mining algorithm in case of apriori hypothesizes are provided. And it can be also used for incremental data mining.
Keywords :
data mining; Web environment; data mining researchers; distributed databases; distributed multiple data sources; hypothesis testing based knowledge discovery; Association rules; Chromium; Distributed databases; Inspection; Testing;
Conference_Titel :
Internet Technology and Applications, 2010 International Conference on
Conference_Location :
Wuhan
Print_ISBN :
978-1-4244-5142-5
Electronic_ISBN :
978-1-4244-5143-2
DOI :
10.1109/ITAPP.2010.5566134