DocumentCode :
254081
Title :
Generalized Max Pooling
Author :
Murray, Naila ; Perronnin, Florent
fYear :
2014
fDate :
23-28 June 2014
Firstpage :
2473
Lastpage :
2480
Abstract :
State-of-the-art patch-based image representations involve a pooling operation that aggregates statistics computed from local descriptors. Standard pooling operations include sum- and max-pooling. Sum-pooling lacks discriminability because the resulting representation is strongly influenced by frequent yet often uninformative descriptors, but only weakly influenced by rare yet potentially highly-informative ones. Max-pooling equalizes the influence of frequent and rare descriptors but is only applicable to representations that rely on count statistics, such as the bag-of-visual-words (BOV)and its soft- and sparse-coding extensions. We propose a novel pooling mechanism that achieves the same effect as max-pooling but is applicable beyond the BOV and especially to the state-of-the-art Fisher Vector -- hence the name Generalized Max Pooling (GMP). It involves equalizing the similarity between each patch and the pooled representation, which is shown to be equivalent to re-weighting the per-patch statistics. We show on five public image classification benchmarks that the proposed GMP can lead to significant performance gains with respect to heuristic alternatives.
Keywords :
image classification; image coding; image representation; vectors; BOV; Fisher vector; bag-of-visual-words; generalized max pooling; local descriptors; max-pooling; patch-based image representations; per-patch statistics; pooled representation; public image classification benchmarks; soft-coding extension; sparse-coding extension; standard pooling operations; sum-pooling; Birds; Computer vision; Encoding; Kernel; Standards; Vectors; Visualization; classification; image representations; pooling;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Vision and Pattern Recognition (CVPR), 2014 IEEE Conference on
Conference_Location :
Columbus, OH
Type :
conf
DOI :
10.1109/CVPR.2014.317
Filename :
6909713
Link To Document :
بازگشت