DocumentCode
3413719
Title
A Framework for Mining Functional Dependencies from Large Distributed Databases
Author
Ye, Feiyue ; Liu, Jixue ; Qian, Jin ; Xue, Xiaofeng
Author_Institution
Coll. of Comput. Eng., Jiangsu Teachers Univ. of Technol., Changzhou, China
Volume
3
fYear
2010
fDate
23-24 Oct. 2010
Firstpage
109
Lastpage
113
Abstract
Discovering functional dependencies FDs from existing databases is important to knowledge discovery, machine learning and data quality assessment. A number of algorithms has been proposed in the literature for FD discovery. However these algorithms are designed to work with centralized databases. When they are applied to distributed databases, communication cost of transporting data from different sites makes the algorithms not efficient. In this paper, We analyze the characteristics of mining functional dependencies from large distributed database, and we propose an distributed mining framework for discovery FDs from distribute large databases. We develop a theorem that can prune candidate FDs effectively and extend the partition based approach for distributed databases.
Keywords
data mining; distributed databases; very large databases; centralized database; data quality assessment; functional dependency mining; knowledge discovery; large distributed database; machine learning; partition based approach; Algorithm design and analysis; Computers; Data mining; Distributed databases; Partitioning algorithms; Relational databases; Data mining; Functional dependencies; Knowledge discovery; Mining functional dependencies;
fLanguage
English
Publisher
ieee
Conference_Titel
Artificial Intelligence and Computational Intelligence (AICI), 2010 International Conference on
Conference_Location
Sanya
Print_ISBN
978-1-4244-8432-4
Type
conf
DOI
10.1109/AICI.2010.262
Filename
5656428
Link To Document