Title :
FD_Mine: discovering functional dependencies in a database using equivalences
Author :
Yao, Hong ; Hamilton, Howard J. ; Butz, Cory J.
Author_Institution :
Dept. of Comput. Sci., Regina Univ., Sask., Canada
Abstract :
The discovery of FDs from databases has recently become a significant research problem. In this paper, we propose a new algorithm, called FD-Mine. FD-Mine takes advantage of the rich theory of FDs to reduce both the size of the dataset and the number of FDs to be checked by using discovered equivalences. We show that the pruning does not lead to loss of information. Experiments on 15 UCI datasets show that FD-Mine can prune more candidates than previous methods.
Keywords :
data mining; relational databases; FD_Mine algorithm; UCI datasets; databases; discovered equivalences; functional dependence discovery; pruning; Chemical compounds; Computer science; Independent component analysis; Lattices; Partitioning algorithms; Relational databases; Sorting;
Conference_Titel :
Data Mining, 2002. ICDM 2003. Proceedings. 2002 IEEE International Conference on
Print_ISBN :
0-7695-1754-4
DOI :
10.1109/ICDM.2002.1184040