DocumentCode :
548148
Title :
An approach to learn categorical distance based on attributes correlatio
Author :
Khorshidpour, Z. ; Hashemi, SayedMasoud ; Hamzeh, Ali
fYear :
2011
fDate :
17-19 May 2011
Firstpage :
1
Lastpage :
1
Abstract :
Summary from only given. Measuring similarity or distance plays a key rolefor data mining and knowledge discovery tasks. A lot of work has been performed on continuous attributes, but for nominal attributes the similarity computation is not relatively well-understood. In this paper, we propose a novel approach to learn a family of dissimilarity measures for categorical data. Based on these measures distance between two different values of an attribute can be determined by using the certain number of attributes rather than all attributes at once. We evaluate our methods in unsupervised environment, Experiments with real data show that our dissimilarity estimation method improves the accuracy of K-Modes clustering algorithm.
Keywords :
Categorical dataConditional probability distribution; Distance function learning; Kullback Leibler divergence;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Electrical Engineering (ICEE), 2011 19th Iranian Conference on
Conference_Location :
Tehran
Print_ISBN :
978-1-4577-0730-8
Type :
conf
Filename :
5956039
Link To Document :
بازگشت