مرکز منطقه ای اطلاع رساني علوم و فناوري - Selecting Salient Frames for Spatiotemporal Video Modeling and Segmentation

DocumentCode :

944809

Title :

Selecting Salient Frames for Spatiotemporal Video Modeling and Segmentation

Author :

Song, Xiaomu ; Fan, Guoliang

Author_Institution :

Oklahoma State Univ., Stillwater

Volume :

Issue :

fYear :

2007

Firstpage :

3035

Lastpage :

3046

Abstract :

We propose a new statistical generative model for spatiotemporal video segmentation. The objective is to partition a video sequence into homogeneous segments that can be used as "building blocks" for semantic video segmentation. The baseline framework is a Gaussian mixture model (GMM)-based video modeling approach that involves a six-dimensional spatiotemporal feature space. Specifically, we introduce the concept of frame saliency to quantify the relevancy of a video frame to the GMM-based spatiotemporal video modeling. This helps us use a small set of salient frames to facilitate the model training by reducing data redundancy and irrelevance. A modified expectation maximization algorithm is developed for simultaneous GMM training and frame saliency estimation, and the frames with the highest saliency values are extracted to refine the GMM estimation for video segmentation. Moreover, it is interesting to find that frame saliency can imply some object behaviors. This makes the proposed method also applicable to other frame-related video analysis tasks, such as key-frame extraction, video skimming, etc. Experiments on real videos demonstrate the effectiveness and efficiency of the proposed method.

Keywords :

Gaussian processes; expectation-maximisation algorithm; feature extraction; image segmentation; image sequences; statistical analysis; video signal processing; GMM training; Gaussian mixture model; expectation maximization algorithm; feature selection; frame saliency estimation; semantic video segmentation; spatiotemporal video modeling; spatiotemporal video segmentation; statistical generative model; video analysis; video sequence; Expectation maximization (EM); Gaussian mixture models (GMMs); feature selection; frame saliency; statistical video modeling; video segmentation; Algorithms; Artificial Intelligence; Computer Simulation; Data Interpretation, Statistical; Image Enhancement; Image Interpretation, Computer-Assisted; Models, Statistical; Pattern Recognition, Automated; Reproducibility of Results; Sensitivity and Specificity; Subtraction Technique; Video Recording;

fLanguage :

English

Journal_Title :

Image Processing, IEEE Transactions on

Publisher :

ieee

ISSN :

1057-7149

Type :

jour

DOI :

10.1109/TIP.2007.908283

Filename :

4358842

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=944809