Associative Embeddings for Large-Scale Knowledge Transfer with Self-Assessment

Author

Vezhnevets, Alexander ; Ferrari, V.

Author_Institution

Univ. of Edinburgh, Edinburgh, UK

fYear

2014

fDate

23-28 June 2014

Firstpage

1987

Lastpage

1994

Abstract

We propose a method for knowledge transfer between semantically related classes in ImageNet. By transferring knowledge from the images that have bounding-box annotations to the others, our method is capable of automatically populating ImageNet with many more bounding-boxes. The underlying assumption that objects from semantically related classes look alike is formalized in our novel Associative Embedding (AE) representation. AE recovers the latent low-dimensional space of appearance variations among image windows. The dimensions of AE space tend to correspond to aspects of window appearance (e.g. side view, close up, background). We model the overlap of a window with an object using Gaussian Processes (GP) regression, which spreads annotation smoothly through AE space. The probabilistic nature of GP allows our method to perform self-assessment, i.e. assigning a quality estimate to its own output. It enables trading off the amount of returned annotations for their quality. A large scale experiment on 219 classes and 0.5 million images demonstrates that our method outperforms state-of-the-art methods and baselines for object localization. Using self-assessment we can automatically return bounding-box annotations for 51% of all images with high localization accuracy (i.e. 71% average overlap with ground-truth).

Keywords

Gaussian processes; image representation; knowledge acquisition; regression analysis; AE representation; GP regression; Gaussian process; ImageNet; associative embeddings; bounding-box annotations; image self-assessment; large-scale knowledge transfer; window appearance aspect; Estimation; Gaussian processes; Kernel; Manuals; Probability distribution; Training; Visualization; ImageNet; large scale; object localization; representation learning; transfer learning;

fLanguage

English

Publisher

ieee

Conference_Titel

Computer Vision and Pattern Recognition (CVPR), 2014 IEEE Conference on

Conference_Location

Columbus, OH

Type

conf

DOI

10.1109/CVPR.2014.255

Filename

6909652