Title of article :
Nearest neighbor classification of categorical data by attributes weighting
Author/Authors :
Chen، نويسنده , , Lifei and Guo، نويسنده , , Gongde، نويسنده ,
Issue Information :
روزنامه با شماره پیاپی سال 2015
Abstract :
Subspace classification of categorical data is an essential process for many real-world applications such as computer-aided medical diagnosis and collaborative recommendation. The nearest neighbor classifiers have sparked wide interest from these applications because of their simplicity and flexibility. However, they become ineffective when applied to categorical data, due to the lack of a well-defined distance measure used to compute dissimilarities between categorical samples in the projected subspaces. In this paper, we tackle the problem by defining a series of weighted distance functions for categorical attributes, and applying them to derive new nearest neighbor classifiers. Four attribute-weighting measures are proposed, with two defined on global feature-ranking approaches while the other two on local approaches. The experimental results conducted on real categorical data sets demonstrate that all four classifiers outperform consistently the traditional methods, and show the suitability of the proposal for the real applications in terms of automated feature selection.
Keywords :
Projected subspace , Categorical data , Nearest neighbor classification , feature selection , Distance measure , attribute weighting
Journal title :
Expert Systems with Applications
Journal title :
Expert Systems with Applications