Title :
Similarity Learning for Nearest Neighbor Classification
Author :
Qamar, Ali Mustafa ; Gaussier, Eric ; Chevallet, Jean-Pierre ; Lim, Joo Hwee
Author_Institution :
Lab. d´´Inf. de Grenoble, Univ. Joseph Fourier, Grenoble
Abstract :
In this paper, we propose an algorithm for learning a general class of similarity measures for kNN classification. This class encompasses, among others, the standard cosine measure, as well as the Dice and Jaccard coefficients. The algorithm we propose is an extension of the voted perceptron algorithm and allows one to learn different types of similarity functions (either based on diagonal, symmetric or asymmetric similarity matrices). The results we obtained show that learning similarity measures yields significant improvements on several collections, for two prediction rules: the standard kNN rule, which was our primary goal, and a symmetric version of it.
Keywords :
learning (artificial intelligence); matrix algebra; pattern classification; Dice coefficient; Jaccard coefficient; asymmetric matrix; diagonal matrix; kNN classification; nearest neighbor classification; similarity learning; standard cosine measure; symmetric matrix; voted perceptron algorithm; Data mining; Databases; Equations; Euclidean distance; Gaussian processes; Machine learning; Measurement standards; Nearest neighbor searches; Pattern recognition; Symmetric matrices; Data Mining; Machine Learning; Nearest Neighbor Classification; Similarity Learning;
Conference_Titel :
Data Mining, 2008. ICDM '08. Eighth IEEE International Conference on
Conference_Location :
Pisa
Print_ISBN :
978-0-7695-3502-9
DOI :
10.1109/ICDM.2008.81