مرکز منطقه ای اطلاع رساني علوم و فناوري - Compositional Models for Video Event Detection: A Multiple Kernel Learning Latent Variable Approach

DocumentCode :

3423304

Title :

Compositional Models for Video Event Detection: A Multiple Kernel Learning Latent Variable Approach

Author :

Vahdat, A. ; Cannons, Kevin ; Mori, Greg ; Sangmin Oh ; Ilseo Kim

Author_Institution :

Simon Fraser Univ., Burnaby, BC, Canada

fYear :

2013

fDate :

1-8 Dec. 2013

Firstpage :

1185

Lastpage :

1192

Abstract :

We present a compositional model for video event detection. A video is modeled using a collection of both global and segment-level features and kernel functions are employed for similarity comparisons. The locations of salient, discriminative video segments are treated as a latent variable, allowing the model to explicitly ignore portions of the video that are unimportant for classification. A novel, multiple kernel learning (MKL) latent support vector machine (SVM) is defined, that is used to combine and re-weight multiple feature types in a principled fashion while simultaneously operating within the latent variable framework. The compositional nature of the proposed model allows it to respond directly to the challenges of temporal clutter and intra-class variation, which are prevalent in unconstrained internet videos. Experimental results on the TRECVID Multimedia Event Detection 2011 (MED11) dataset demonstrate the efficacy of the method.

Keywords :

clutter; image classification; image segmentation; learning (artificial intelligence); multimedia communication; support vector machines; video signal processing; MKL latent SVM; TRECVID MED11 dataset; TRECVID multimedia event detection 2011 dataset; compositional model; discriminative video segment; global level feature; intraclass variation; kernel function; latent variable framework; multiple-kernel learning latent support vector machine; multiple-kernel learning latent variable approach; salient video segment; segment-level feature; similarity comparison; temporal clutter; unconstrained Internet videos; video classification; video event detection; Computational modeling; Feature extraction; Kernel; Linear programming; Standards; Support vector machines; Training;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Computer Vision (ICCV), 2013 IEEE International Conference on

Conference_Location :

Sydney, NSW

ISSN :

1550-5499

Type :

conf

DOI :

10.1109/ICCV.2013.463

Filename :

6751257

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3423304