مرکز منطقه ای اطلاع رساني علوم و فناوري - Video Event Detection by Inferring Temporal Instance Labels

DocumentCode :

254049

Title :

Video Event Detection by Inferring Temporal Instance Labels

Author :

Kuan-Ting Lai ; Yu, Felix X. ; Ming-Syan Chen ; Shih-Fu Chang

Author_Institution :

Dept. of Electr. Eng., Columbia Univ., New York, NY, USA

fYear :

2014

fDate :

23-28 June 2014

Firstpage :

2251

Lastpage :

2258

Abstract :

Video event detection allows intelligent indexing of video content based on events. Traditional approaches extract features from video frames or shots, then quantize and pool the features to form a single vector representation for the entire video. Though simple and efficient, the final pooling step may lead to loss of temporally local information, which is important in indicating which part in a long video signifies presence of the event. In this work, we propose a novel instance-based video event detection approach. We represent each video as multiple ´instances´, defined as video segments of different temporal intervals. The objective is to learn an instance-level event detection model based on only video-level labels. To solve this problem, we propose a large-margin formulation which treats the instance labels as hidden latent variables, and simultaneously infers the instance labels as well as the instance-level classification model. Our framework infers optimal solutions that assume positive videos have a large number of positive instances while negative videos have the fewest ones. Extensive experiments on large-scale video event datasets demonstrate significant performance gains. The proposed method is also useful in explaining the detection results by localizing the temporal segments in a video which is responsible for the positive detection.

Keywords :

feature extraction; image classification; learning (artificial intelligence); video signal processing; MIL; feature extraction; instance-level classification model; large-margin formulation; multiple-instance learning; temporal instance labels; temporal segments; video content indexing; video event detection; Event detection; Feature extraction; Linear programming; Optimization; Predictive models; Support vector machines; Vectors; Multiple Instance Learning; Proportion SVM; Video Event Detection;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Computer Vision and Pattern Recognition (CVPR), 2014 IEEE Conference on

Conference_Location :

Columbus, OH

Type :

conf

DOI :

10.1109/CVPR.2014.288

Filename :

6909685

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=254049