Title :
Towards effective subspace clustering with an evolutionary algorithm
Author :
Sarafis, Ioannis A. ; Trinder, P.W. ; Zalzala, Ali M S
Author_Institution :
Sch. of Math. & Comput. Sci., Heriot-Watt Univ., Edinburgh, UK
Abstract :
We propose a new evolutionary algorithm for subspace clustering in very large and high-dimensional databases. The design includes task-specific coding and genetic operators, along with a nonrandom initialization procedure. Experimental results show that the algorithm scales almost linearly with the size and dimensionality of the database as well as the dimensionality of the hidden clusters. Our algorithm is able to discover clusters of different densities embedded in both low and high dimensional subspaces of the original space. Finally, the discovered knowledge is presented in the form of nonoverlapping clustering rules where only those features relevant to the clustering are reported. These two properties contributes to the relatively high comprehensibility of the clustering output.
Keywords :
data mining; evolutionary computation; pattern clustering; very large databases; evolutionary algorithm; genetic operators; hidden clusters; knowledge discovery; nonoverlap clustering rules; nonrandom initialization procedure; subspace clustering; task-specific coding; very large databases; Clustering algorithms; Data analysis; Databases; Encoding; Evolutionary computation; Genetic mutations; Physics computing;
Conference_Titel :
Evolutionary Computation, 2003. CEC '03. The 2003 Congress on
Print_ISBN :
0-7803-7804-0
DOI :
10.1109/CEC.2003.1299749