• DocumentCode
    254701
  • Title

    Video-Based Object Recognition Using Novel Set-of-Sets Representations

  • Author

    Yang Liu ; Youngkyoon Jang ; Woontack Woo ; Tae-Kyun Kim

  • Author_Institution
    Imperial Coll. London, London, UK
  • fYear
    2014
  • fDate
    23-28 June 2014
  • Firstpage
    533
  • Lastpage
    540
  • Abstract
    We address the problem of object recognition in egocentric videos, where a user arbitrarily moves a mobile camera around an unknown object. Using a video that captures variation in an object´s appearance owing to camera motion (more viewpoints, scales, clutter and lighting conditions), can accumulate evidence and improve object recognition accuracy. Most previous work has taken a single image as input, or tackled a video simply by a collection i.e. sum of frame-based recognition scores. In this paper, beyond frame-based recognition, we propose two novel set-of-sets representations of a video sequence for object recognition. We combine the techniques of bag of words for a set of data spatially distributed thus heterogeneous, and manifold for a set of data temporally smooth and homogeneous, to construct the two proposed set-of-sets representations. We also propose methods to perform matching for the two representations respectively. The representations and matching techniques are evaluated on our video-based object recognition datasets, which contain 830 videos of ten objects and four environmental variations. The experiments on the challenging new datasets show that our proposed solution significantly outperforms the traditional frame-based methods.
  • Keywords
    cameras; image matching; image motion analysis; image representation; image sequences; object recognition; video signal processing; bag of words techniques; egocentric videos; environmental variations; heterogeneous data; matching techniques; mobile camera motion; object appearance; offrame-based recognition scores; set-of-set representation; spatially distributed data; video sequence; video-based object recognition accuracy; video-based object recognition datasets; Image recognition; Kernel; Manifolds; Object recognition; Trajectory; Vectors; Videos; object recognition; set-of-sets representations; video analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Vision and Pattern Recognition Workshops (CVPRW), 2014 IEEE Conference on
  • Conference_Location
    Columbus, OH
  • Type

    conf

  • DOI
    10.1109/CVPRW.2014.83
  • Filename
    6910032