DocumentCode :
253851
Title :
Deep Fisher Kernels -- End to End Learning of the Fisher Kernel GMM Parameters
Author :
Sydorov, Vladyslav ; Sakurada, Mayu ; Lampert, Christoph H.
Author_Institution :
IST Austria, Klosterneuburg, Austria
fYear :
2014
fDate :
23-28 June 2014
Firstpage :
1402
Lastpage :
1409
Abstract :
Fisher Kernels and Deep Learning were two developments with significant impact on large-scale object categorization in the last years. Both approaches were shown to achieve state-of-the-art results on large-scale object categorization datasets, such as ImageNet. Conceptually, however, they are perceived as very different and it is not uncommon for heated debates to spring up when advocates of both paradigms meet at conferences or workshops. In this work, we emphasize the similarities between both architectures rather than their differences and we argue that such a unified view allows us to transfer ideas from one domain to the other. As a concrete example we introduce a method for learning a support vector machine classifier with Fisher kernel at the same time as a task-specific data representation. We reinterpret the setting as a multi-layer feed forward network. Its final layer is the classifier, parameterized by a weight vector, and the two previous layers compute Fisher vectors, parameterized by the coefficients of a Gaussian mixture model. We introduce a gradient descent based learning algorithm that, in contrast to other feature learning techniques, is not just derived from intuition or biological analogy, but has a theoretical justification in the framework of statistical learning theory. Our experiments show that the new training procedure leads to significant improvements in classification accuracy while preserving the modularity and geometric interpretability of a support vector machine setup.
Keywords :
Gaussian processes; feedforward neural nets; gradient methods; image classification; learning (artificial intelligence); support vector machines; Fisher kernel GMM parameters; Fisher vectors; Gaussian mixture model; classification accuracy improvements; deep Fisher kernels; deep learning; end to end learning; geometric interpretability; gradient descent-based learning algorithm; large-scale object categorization datasets; modularity preservation; multilayer feedforward network; statistical learning theory; support vector machine classifier; task-specific data representation; weight vector; Accuracy; Computer architecture; Computer vision; Kernel; Support vector machines; Training; Vectors; Fisher kernel; Gaussian mixture models; deep learning; image classification; support vector machines;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Vision and Pattern Recognition (CVPR), 2014 IEEE Conference on
Conference_Location :
Columbus, OH
Type :
conf
DOI :
10.1109/CVPR.2014.182
Filename :
6909578
Link To Document :
بازگشت