Title :
A Framework for Mining Functional Dependencies from Large Distributed Databases
Author :
Ye, Feiyue ; Liu, Jixue ; Qian, Jin ; Xue, Xiaofeng
Author_Institution :
Coll. of Comput. Eng., Jiangsu Teachers Univ. of Technol., Changzhou, China
Abstract :
Discovering functional dependencies FDs from existing databases is important to knowledge discovery, machine learning and data quality assessment. A number of algorithms has been proposed in the literature for FD discovery. However these algorithms are designed to work with centralized databases. When they are applied to distributed databases, communication cost of transporting data from different sites makes the algorithms not efficient. In this paper, We analyze the characteristics of mining functional dependencies from large distributed database, and we propose an distributed mining framework for discovery FDs from distribute large databases. We develop a theorem that can prune candidate FDs effectively and extend the partition based approach for distributed databases.
Keywords :
data mining; distributed databases; very large databases; centralized database; data quality assessment; functional dependency mining; knowledge discovery; large distributed database; machine learning; partition based approach; Algorithm design and analysis; Computers; Data mining; Distributed databases; Partitioning algorithms; Relational databases; Data mining; Functional dependencies; Knowledge discovery; Mining functional dependencies;
Conference_Titel :
Artificial Intelligence and Computational Intelligence (AICI), 2010 International Conference on
Conference_Location :
Sanya
Print_ISBN :
978-1-4244-8432-4
DOI :
10.1109/AICI.2010.262