DocumentCode :
3107573
Title :
Clustering data set with categorical feature using multi objective genetic algorithm
Author :
Dutta, Dipankar ; Dutta, Paramartha ; Sil, Jaya
Author_Institution :
Dept. of Comput. Sci. & Inf. Technol., Univ. Inst. of Technol., Golapbug, India
fYear :
2012
fDate :
18-20 July 2012
Firstpage :
103
Lastpage :
108
Abstract :
In the paper, real coded multi objective genetic algorithm based K-clustering method has been studied where K represents the number of clusters known apriori. The searching power of Genetic Algorithm (GA) is exploited to search for suitable clusters and cluster modes so that intra-cluster distance (Homogeneity, H) and inter-cluster distances (Separation, S) are simultaneously optimized. It is achieved by measuring H and S using Mod distance per feature metric, suitable for categorical features (attributes). We have selected 3 benchmark data sets from UCI Machine Learning Repository containing categorical features only. Here, K-modes is hybridized with GA to combine global searching capabilities of GA with local searching capabilities of K-modes. Considering context sensitivity, we have used a special crossover operator called “pairwise crossover” and “substitution”.
Keywords :
data mining; genetic algorithms; learning (artificial intelligence); pattern clustering; GA; K-clustering method; K-modes; Mod distance; UCI machine learning repository; categorical feature; categorical features; cluster modes; context sensitivity; crossover operator; data set clustering; global searching capabilities; intercluster distances; intracluster distance; local searching capabilities; pairwise crossover; real coded multiobjective genetic algorithm; substitution; Biological cells; Clustering algorithms; Genetic algorithms; Indexes; Optimization; Sociology; Statistics;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Science & Engineering (ICDSE), 2012 International Conference on
Conference_Location :
Cochin, Kerala
Print_ISBN :
978-1-4673-2148-8
Type :
conf
DOI :
10.1109/ICDSE.2012.6281897
Filename :
6281897
Link To Document :
بازگشت