DocumentCode :
3672582
Title :
Beyond spatial pooling: Fine-grained representation learning in multiple domains
Author :
Chi Li;Austin Reiter;Gregory D. Hager
Author_Institution :
Department of Computer Science, Johns Hopkins University, Baltimore, MD 21218, United States
fYear :
2015
fDate :
6/1/2015 12:00:00 AM
Firstpage :
4913
Lastpage :
4922
Abstract :
Object recognition systems have shown great progress over recent years. However, creating object representations that are robust to changes in viewpoint while capturing local visual details continues to be a challenge. In particular, recent convolutional architectures employ spatial pooling to achieve scale and shift invariances, but they are still sensitive to out-of-plane rotations. In this paper, we formulate a probabilistic framework for analyzing the performance of pooling. This framework suggests two directions for improvement. First, we apply multiple scales of filters coupled with different pooling granularities, and second we make use of color as an additional pooling domain, thereby reducing the sensitivity to spatial deformations. We evaluate our algorithm on the object instance recognition task using two independent publicly available RGB-D datasets, and demonstrate significant improvements over the current state-of-the-art. In addition, we present a new dataset for industrial objects to further validate the effectiveness of our approach versus other state-of-the-art approaches for object recognition using RGB-D data.
Keywords :
"Image color analysis","Yttrium","Three-dimensional displays","Visualization","Probabilistic logic","Dictionaries","Object recognition"
Publisher :
ieee
Conference_Titel :
Computer Vision and Pattern Recognition (CVPR), 2015 IEEE Conference on
Electronic_ISBN :
1063-6919
Type :
conf
DOI :
10.1109/CVPR.2015.7299125
Filename :
7299125
Link To Document :
بازگشت