Title :
Database redundant attribute detection using fractal dimension
Author_Institution :
Dept. of Comput. Sci., Jinan Univ., Guangzhou, China
Abstract :
The method for detecting redundant attributes in relational datasets using the fractal ideology is studied. Based on the fractal dimension of a dataset and its variations, an algorithm for detecting redundant attributes is presented. The work has the following features: datasets with numeric and discrete attributes can be processed; an approach based on depth-equal data dimension division(i.e., the number of attribute values in each interval of a divided dimension is the same) is introduced for computing approximate fractal dimension of a dataset; and the existed algorithm for detecting redundant attributes is expanded, which can discover correlated attribute pairs in a dataset. The experimental results prove the validity of the proposed algorithms.
Keywords :
database management systems; fractals; approximate fractal dimension; correlated attribute pairs; database redundant attribute detection; fractal ideology; redundant attributes; relational datasets; Approximation algorithms; Correlation; Data mining; Databases; Feature extraction; Fractals; data quality; fractal dimension; redundant attribute;
Conference_Titel :
Software Engineering and Service Science (ICSESS), 2014 5th IEEE International Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4799-3278-8
DOI :
10.1109/ICSESS.2014.6933630