Retrieving objects from videos based on affine regions

Author

Ferrari, Vittorio ; Tuytelaars, Tinne ; Van Gool, Luc

Author_Institution

Comput. Vision Group (BIWI), ETH Zurich, Zurich, Switzerland

fYear

2004

fDate

6-10 Sept. 2004

Firstpage

1733

Lastpage

1736

Abstract

We present a method to (semi-)automatically annotate video material. More precisely, we focus on recognizing specific objects and scenes in keyframes. Objects are learnt simply by having the user delineate them in one (or a few) images. The basic building block to achieve this goal consists of affine invariant regions. These are local image patches that adapt their shape based on the image content so as to be invariant to viewpoint changes. Instead of simply matching the regions and counting the number of matches, we propose to gather more evidence about the presence of the object by exploring the image around the initial matches. This boosts the performance, especially under difficult, real-world imaging conditions. Experimental results on news broadcast data demonstrate the viability of the approach.

Keywords

image matching; object recognition; video retrieval; video signal processing; affine invariant regions; automatic video material annotation; image content; local image patches; object recognition; object retrieval; real-world imaging conditions; scene recognition; Abstracts; Painting;

fLanguage

English

Publisher

ieee

Conference_Titel

Signal Processing Conference, 2004 12th European

Conference_Location

Vienna

Print_ISBN

978-320-0001-65-7

Type

conf

Filename

7080120