• DocumentCode
    720896
  • Title

    Object detection and depth estimation for 3D trajectory extraction

  • Author

    Boukhers, Zeyd ; Shirahama, Kimiaki ; Li, Frederic ; Grzegorzek, Marcin

  • Author_Institution
    Pattern Recognition Group, Univ. of Siegen, Siegen, Germany
  • fYear
    2015
  • fDate
    10-12 June 2015
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    To detect an event which is defined by the interaction of objects in a video, it is necessary to capture their spatio-temporal relation. However, the video only displays the original 3D space which is projected onto a 2D image plane. This paper introduces a method which extracts 3D trajectories of objects from 2D videos. Each trajectory represents the transition of an object´s positions in the 3D space. We extract such trajectories by combining object detection with depth estimation that estimates the depth information in 2D videos. The major problem for this is the inconsistency between object detection and depth estimation results. For example, significantly different depths may be estimated for the region of the same object, and an object region that is appropriately shaped by estimated depths may be missed. To overcome this, we first initialise the 3D position of an object by selecting the frame with the highest consistency between the object detection and depth estimation results. Then, we track the object in the 3D space using particle filter, where the 3D position of this object is modelled as a hidden state to generate its 2D visual appearance. Experimental results demonstrate the effectiveness of our method.
  • Keywords
    feature extraction; object detection; object tracking; particle filtering (numerical methods); video retrieval; video signal processing; 2D image plane; 2D videos; 2D visual appearance; 3D position; 3D space; 3D trajectory extraction; depth estimation; event detection; object detection; object positions; object region; object tracking; particle filter; spatio-temporal relation; video objects interaction; video retrieval; Cameras; Estimation; Object detection; Three-dimensional displays; Training; Trajectory; Visualization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Content-Based Multimedia Indexing (CBMI), 2015 13th International Workshop on
  • Conference_Location
    Prague
  • Type

    conf

  • DOI
    10.1109/CBMI.2015.7153632
  • Filename
    7153632