مرکز منطقه ای اطلاع رساني علوم و فناوري - Combined Generative-Discriminative Learning for Object Recognition using Local Image Descriptors

DocumentCode :

2552788

Title :

Combined Generative-Discriminative Learning for Object Recognition using Local Image Descriptors

Author :

Nag, Abhikesh ; Miller, David J. ; Brown, Andrew P. ; Sullivan, Kevin J.

Author_Institution :

Penn State Univ., University Park

fYear :

2007

fDate :

27-29 Aug. 2007

Firstpage :

360

Lastpage :

365

Abstract :

We present a system for scale and affine invariant recognition of vehicular objects in video sequences. We use local descriptors (SIFT keypoints) from image frames to model the object. These features are claimed in the literature to be highly distinctive and invariant to rotation, scale, and affine transformations. However, since the SIFT keypoints that are extracted from an object are instance-specific (variable), they form a dynamic feature space. This presents certain challenges for classification techniques, which generally require use of the same set of features for every instance of an object to be classified. To resolve this difficulty, we associate the extracted keypoints to the components (representative keypoints) in a mixture model for each target class. While the extracted keypoints are variable, the mixture components are fixed. The mixture models the keypoint features, as well as the location and scale at which each keypoint was detected in the frame. Key- point to component association is achieved via a switching optimization procedure that locally maximizes the joint likelihood of keypoints and their locations and scales with the latter based on an affine transformation. To each mixture component from a class, we link a (first layer) support vector machine (SVM) classifier which votes for or against the hypothesis that the keypoint associated to the component belongs to the model´s target class. A second layer SVM pools the votes from the ensemble of SVM classifiers in the first layer and gives the final class decision. We show promising results of experiments for video sequences from the VIVID database.

Keywords :

feature extraction; object recognition; optimisation; support vector machines; vehicles; SIFT keypoint; affine invariant recognition; generative-discriminative learning; local image descriptor; object recognition; scale invariant recognition; support vector machine; switching optimization procedure; vehicular object; Databases; Feature extraction; Image recognition; Object recognition; Support vector machine classification; Support vector machines; Target tracking; Vehicle dynamics; Video sequences; Voting;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Machine Learning for Signal Processing, 2007 IEEE Workshop on

Conference_Location :

Thessaloniki

ISSN :

1551-2541

Print_ISBN :

978-1-4244-1566-3

Electronic_ISBN :

1551-2541

Type :

conf

DOI :

10.1109/MLSP.2007.4414333

Filename :

4414333

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2552788