DocumentCode
699590
Title
Retrieving objects from videos based on affine regions
Author
Ferrari, Vittorio ; Tuytelaars, Tinne ; Van Gool, Luc
Author_Institution
Comput. Vision Group (BIWI), ETH Zurich, Zurich, Switzerland
fYear
2004
fDate
6-10 Sept. 2004
Firstpage
1733
Lastpage
1736
Abstract
We present a method to (semi-)automatically annotate video material. More precisely, we focus on recognizing specific objects and scenes in keyframes. Objects are learnt simply by having the user delineate them in one (or a few) images. The basic building block to achieve this goal consists of affine invariant regions. These are local image patches that adapt their shape based on the image content so as to be invariant to viewpoint changes. Instead of simply matching the regions and counting the number of matches, we propose to gather more evidence about the presence of the object by exploring the image around the initial matches. This boosts the performance, especially under difficult, real-world imaging conditions. Experimental results on news broadcast data demonstrate the viability of the approach.
Keywords
image matching; object recognition; video retrieval; video signal processing; affine invariant regions; automatic video material annotation; image content; local image patches; object recognition; object retrieval; real-world imaging conditions; scene recognition; Abstracts; Painting;
fLanguage
English
Publisher
ieee
Conference_Titel
Signal Processing Conference, 2004 12th European
Conference_Location
Vienna
Print_ISBN
978-320-0001-65-7
Type
conf
Filename
7080120
Link To Document