Title :
Variable Density Based Genetic Clustering
Author :
Sabau, Andrei Sorin
Author_Institution :
Fac. of Math. & Comput. Sci., Univ. of Pitesti, Pitesti, Romania
Abstract :
From the existing clustering techniques, spatial density-based ones register one of the most promising results in detecting arbitrary shaped data, being robust to outliers and not restricted by various data distributions. The existing literature contains a plethora of density based algorithms but in all cases one or multiple global parameters need to be set, parameters that are seldom easy to set requiring in depth knowledge about the analyzed data. This paper proposes a parameter-free novel genetic clustering algorithm with an original method for encoding clustering solutions relying on density based clustering parameters. Within each clustering solution genotype, gene position, defined by several density based clustering attributes, plays a key role for recovering the encoded partition. Each gene defined density-based cluster can only attract object not already attracted by previously defined clusters. The proposed encoding scheme allows for always valid crossover results, with great offspring variations even when using simple crossover operators. While not requiring any input parameters, experiments involving multiple clustering validation indices as fitness criteria, across both synthetic and real data sets, show comparable results with existing density-based clustering techniques.
Keywords :
data compression; genetic algorithms; genetics; medical computing; pattern clustering; arbitrary shaped data detection; clustering solution encoding; clustering solution genotype; clustering validation indices; crossover operators; data analysis; data distributions; density-based clustering attributes; fitness criteria; gene position; global parameter-free genetic clustering algorithm; offspring variations; real data sets; spatial density-based ones register method; synthetic sets; variable density-based genetic clustering parameters; Clustering algorithms; Encoding; Genetics; Indexes; Partitioning algorithms; Prototypes; Sociology; density based clustering; encoding algorithm; genetic algorithm; genetic clustering; parameter free clustering;
Conference_Titel :
Symbolic and Numeric Algorithms for Scientific Computing (SYNASC), 2012 14th International Symposium on
Conference_Location :
Timisoara
Print_ISBN :
978-1-4673-5026-6
DOI :
10.1109/SYNASC.2012.31