DocumentCode :
679526
Title :
Kernel Density Metric Learning
Author :
Yujie He ; Wenlin Chen ; Yixin Chen ; Yi Mao
Author_Institution :
Dept. of Comput. Sci. & Eng., Washington Univ., St. Louis, MO, USA
fYear :
2013
fDate :
7-10 Dec. 2013
Firstpage :
271
Lastpage :
280
Abstract :
This paper introduces a supervised metric learning algorithm, called kernel density metric learning (KDML), which is easy to use and provides nonlinear, probability-based distance measures. KDML constructs a direct nonlinear mapping from the original input space into a feature space based on kernel density estimation. The nonlinear mapping in KDML embodies established distance measures between probability density functions, and leads to correct classification on datasets for which linear metric learning methods would fail. It addresses the severe challenge to kNN when features are from heterogeneous domains and, as a result, the Euclidean or Mahalanobis distance between original feature vectors is not meaningful. Existing metric learning algorithms can then be applied to the KDML features. We also propose an integrated optimization algorithm that learns not only the Mahalanobis matrix but also kernel bandwidths, the only hyper-parameters in the nonlinear mapping. KDML can naturally handle not only numerical features, but also categorical ones, which is rarely found in previous metric learning algorithms. Extensive experimental results on various datasets show that KDML significantly improves existing metric learning algorithms in terms of kNN classification accuracy.
Keywords :
learning (artificial intelligence); matrix algebra; optimisation; pattern classification; probability; Euclidean distance; KDML; Mahalanobis distance; Mahalanobis matrix; dataset classification; direct nonlinear mapping; feature space; feature vectors; heterogeneous domain; input space; integrated optimization algorithm; kNN classification accuracy; kernel bandwidth; kernel density estimation; kernel density metric learning; probability density functions; probability-based distance measures; supervised metric learning algorithm; Density measurement; Euclidean distance; Kernel; Learning systems; Optimization; Vectors;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining (ICDM), 2013 IEEE 13th International Conference on
Conference_Location :
Dallas, TX
ISSN :
1550-4786
Type :
conf
DOI :
10.1109/ICDM.2013.153
Filename :
6729511
Link To Document :
بازگشت