Video-Based Object Recognition Using Novel Set-of-Sets Representations

Author

Yang Liu ; Youngkyoon Jang ; Woontack Woo ; Tae-Kyun Kim

Author_Institution

Imperial Coll. London, London, UK

fYear

2014

fDate

23-28 June 2014

Firstpage

533

Lastpage

540

Abstract

We address the problem of object recognition in egocentric videos, where a user arbitrarily moves a mobile camera around an unknown object. Using a video that captures variation in an object´s appearance owing to camera motion (more viewpoints, scales, clutter and lighting conditions), can accumulate evidence and improve object recognition accuracy. Most previous work has taken a single image as input, or tackled a video simply by a collection i.e. sum of frame-based recognition scores. In this paper, beyond frame-based recognition, we propose two novel set-of-sets representations of a video sequence for object recognition. We combine the techniques of bag of words for a set of data spatially distributed thus heterogeneous, and manifold for a set of data temporally smooth and homogeneous, to construct the two proposed set-of-sets representations. We also propose methods to perform matching for the two representations respectively. The representations and matching techniques are evaluated on our video-based object recognition datasets, which contain 830 videos of ten objects and four environmental variations. The experiments on the challenging new datasets show that our proposed solution significantly outperforms the traditional frame-based methods.

Keywords

cameras; image matching; image motion analysis; image representation; image sequences; object recognition; video signal processing; bag of words techniques; egocentric videos; environmental variations; heterogeneous data; matching techniques; mobile camera motion; object appearance; offrame-based recognition scores; set-of-set representation; spatially distributed data; video sequence; video-based object recognition accuracy; video-based object recognition datasets; Image recognition; Kernel; Manifolds; Object recognition; Trajectory; Vectors; Videos; object recognition; set-of-sets representations; video analysis;

fLanguage

English

Publisher

ieee

Conference_Titel

Computer Vision and Pattern Recognition Workshops (CVPRW), 2014 IEEE Conference on

Conference_Location

Columbus, OH

Type

conf

DOI

10.1109/CVPRW.2014.83

Filename

6910032