Abstract :
In this paper, we extend a recently proposed method for generic object detection in images, category-independent object proposals, to the case of video. Given a video, the output of our algorithm is a set of video segments that are likely to contain an object. This can be useful, e.g., as a first step in a video object detection system. Given the sheer amount of pixels in a video, a straightforward extension of the 2D methods to a 3D (spatiotemporal) volume is not feasible. Instead, we start by extracting object proposals in each frame separately. These are linked across frames into object hypotheses, which are then used as higher-order potentials in a graph-based video segmentation framework. Running multiple segmentations and ranking the segments based on the likelihood that they correspond to an object, yields our final set of video object proposals.
Keywords :
graph theory; image segmentation; object detection; video signal processing; 2D methods; category-independent object proposals; generic object detection; graph-based video segmentation framework; higher-order potentials; video object detection system; video object proposals; Image color analysis; Image segmentation; Labeling; Lead; Motion segmentation; Object detection; Proposals;