DocumentCode
2173506
Title
Using temporal coherence to build models of animals
Author
Ramanan, Deva ; Forsyth, D.A.
Author_Institution
Comput. Sci. Div., California Univ., Berkeley, CA, USA
fYear
2003
fDate
13-16 Oct. 2003
Firstpage
338
Abstract
We describe a system that can build appearance models of animals automatically from a video sequence of the relevant animal with no explicit supervisory information. The video sequence need not have any form of special background. Animals are modeled as a 2D kinematic chain of rectangular segments, where the number of segments and the topology of the chain are unknown. The system detects possible segments, clusters segments whose appearance is coherent over time, and then builds a spatial model of such segment clusters. The resulting representation of the spatial configuration of the animal in each frame can be seen either as a track - in which case the system described should be viewed as a generalized tracker, that is capable of modeling objects while tracking them - or as the source of an appearance model which can be used to build detectors for the particular animal. This is because knowing a video sequence is temporally coherent - i.e. that a particular animal is present through the sequence - is a strong supervisory signal. The method is shown to be successful as a tracker on video sequences of real scenes showing three different animals. For the same reason it is successful as a tracker, the method results in detectors that can be used to find each animal fairly reliably within the Corel collection of images.
Keywords
image representation; image segmentation; image sequences; object detection; pattern clustering; realistic images; spatiotemporal phenomena; tracking; 2D kinematic chain; Corel image collection; animal appearance models; object modeling; object tracking; real scenes; rectangular segments; segment clusters; spatial model; supervisory information; temporal coherence; video sequence; Animals; Coherence; Computer science; Detectors; Kinematics; Layout; Object detection; Region 4; Topology; Video sequences;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Vision, 2003. Proceedings. Ninth IEEE International Conference on
Conference_Location
Nice, France
Print_ISBN
0-7695-1950-4
Type
conf
DOI
10.1109/ICCV.2003.1238364
Filename
1238364
Link To Document