DocumentCode :
2779354
Title :
Symbiotic evolutionary subspace clustering
Author :
Vahdat, Ali ; Heywood, Malcolm I. ; Zincir-Heywood, A. Nur
Author_Institution :
Fac. of Comput. Sci., Dalhousie Univ., Halifax, NS, Canada
fYear :
2012
fDate :
10-15 June 2012
Firstpage :
1
Lastpage :
8
Abstract :
New emerging high-dimensional data sets have made traditional clustering algorithms increasingly inefficient. More sophisticated approaches are required to cope with the increasing dimensionality and cardinality of such data sets. Feature selection methods are proposed as a solution to deal with this problem, however they fail for data sets where the attribute support for different clusters is not the same. For this category of data sets subspace clustering algorithms have been introduced over the past decade. We approach this problem from the perspective of Genetic Algorithms by adopting a hierarchical data structure deployed in three stages. 1) a traditional clustering algorithm is applied independently to each attribute of the data set, thus defining a grid of potential 1-d cluster centroids. 2) representing multi-dimensional cluster centroids by indexing 1-d cluster centroids. 3) converting the problem of finding the best combination of cluster centroids into that of discrete optimization and applying a multi-objective evolutionary algorithm, which uses group fitness evaluation to give a fitness to a group of clusters, as defined by process 2. Synthetic data sets with different characteristics are generated as the ground truth to evaluate the resulting algorithm for Evolutionary Subspace Clustering (ESC) as well as benchmark against alternative subspace and full-space clustering algorithms. ESC returns competitive accuracy and while typically utilizing less attributes and scaling as attribute count increases.
Keywords :
data structures; genetic algorithms; pattern clustering; 1-d cluster centroids; ESC; attribute support; discrete optimization; genetic algorithms; group fitness evaluation; hierarchical data structure; high-dimensional data sets; multiobjective evolutionary algorithm; symbiotic evolutionary subspace clustering; Accuracy; Benchmark testing; Clustering algorithms; Encoding; Indexes; Optimization; Runtime;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Evolutionary Computation (CEC), 2012 IEEE Congress on
Conference_Location :
Brisbane, QLD
Print_ISBN :
978-1-4673-1510-4
Electronic_ISBN :
978-1-4673-1508-1
Type :
conf
DOI :
10.1109/CEC.2012.6252895
Filename :
6252895
Link To Document :
بازگشت