• DocumentCode
    253982
  • Title

    Associative Embeddings for Large-Scale Knowledge Transfer with Self-Assessment

  • Author

    Vezhnevets, Alexander ; Ferrari, V.

  • Author_Institution
    Univ. of Edinburgh, Edinburgh, UK
  • fYear
    2014
  • fDate
    23-28 June 2014
  • Firstpage
    1987
  • Lastpage
    1994
  • Abstract
    We propose a method for knowledge transfer between semantically related classes in ImageNet. By transferring knowledge from the images that have bounding-box annotations to the others, our method is capable of automatically populating ImageNet with many more bounding-boxes. The underlying assumption that objects from semantically related classes look alike is formalized in our novel Associative Embedding (AE) representation. AE recovers the latent low-dimensional space of appearance variations among image windows. The dimensions of AE space tend to correspond to aspects of window appearance (e.g. side view, close up, background). We model the overlap of a window with an object using Gaussian Processes (GP) regression, which spreads annotation smoothly through AE space. The probabilistic nature of GP allows our method to perform self-assessment, i.e. assigning a quality estimate to its own output. It enables trading off the amount of returned annotations for their quality. A large scale experiment on 219 classes and 0.5 million images demonstrates that our method outperforms state-of-the-art methods and baselines for object localization. Using self-assessment we can automatically return bounding-box annotations for 51% of all images with high localization accuracy (i.e. 71% average overlap with ground-truth).
  • Keywords
    Gaussian processes; image representation; knowledge acquisition; regression analysis; AE representation; GP regression; Gaussian process; ImageNet; associative embeddings; bounding-box annotations; image self-assessment; large-scale knowledge transfer; window appearance aspect; Estimation; Gaussian processes; Kernel; Manuals; Probability distribution; Training; Visualization; ImageNet; large scale; object localization; representation learning; transfer learning;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Vision and Pattern Recognition (CVPR), 2014 IEEE Conference on
  • Conference_Location
    Columbus, OH
  • Type

    conf

  • DOI
    10.1109/CVPR.2014.255
  • Filename
    6909652