• DocumentCode
    253717
  • Title

    Ask the Image: Supervised Pooling to Preserve Feature Locality

  • Author

    Fanello, S.R. ; Noceti, Nicoletta ; Ciliberto, Carlo ; Metta, G. ; Odone, F.

  • fYear
    2014
  • fDate
    23-28 June 2014
  • Firstpage
    851
  • Lastpage
    858
  • Abstract
    In this paper we propose a weighted supervised pooling method for visual recognition systems. We combine a standard Spatial Pyramid Representation which is commonly adopted to encode spatial information, with an appropriate Feature Space Representation favoring semantic information in an appropriate feature space. For the latter, we propose a weighted pooling strategy exploiting data supervision to weigh each local descriptor coherently with its likelihood to belong to a given object class. The two representations are then combined adaptively with Multiple Kernel Learning. Experiments on common benchmarks (Caltech-256 and PASCAL VOC-2007) show that our image representation improves the current visual recognition pipeline and it is competitive with similar state-of-art pooling methods. We also evaluate our method on a real Human-Robot Interaction setting, where the pure Spatial Pyramid Representation does not provide sufficient discriminative power, obtaining a remarkable improvement.
  • Keywords
    feature extraction; human-robot interaction; image representation; learning (artificial intelligence); object recognition; robot vision; Caltech-256; PASCAL VOC-2007; data supervision; feature locality preservation; feature space representation; human-robot interaction setting; image representation; local descriptor; multiple kernel learning; semantic information; spatial information encoding; standard spatial pyramid representation; visual recognition pipeline; visual recognition systems; weighted supervised pooling method; Dictionaries; Encoding; Feature extraction; Kernel; Pipelines; Standards; Visualization; Human Robot Interaction; Object Categorization; Object Recognition; Supervised Pooling; iCub; iCubWorld Dataset;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Vision and Pattern Recognition (CVPR), 2014 IEEE Conference on
  • Conference_Location
    Columbus, OH
  • Type

    conf

  • DOI
    10.1109/CVPR.2014.114
  • Filename
    6909509