DocumentCode :
944809
Title :
Selecting Salient Frames for Spatiotemporal Video Modeling and Segmentation
Author :
Song, Xiaomu ; Fan, Guoliang
Author_Institution :
Oklahoma State Univ., Stillwater
Volume :
16
Issue :
12
fYear :
2007
Firstpage :
3035
Lastpage :
3046
Abstract :
We propose a new statistical generative model for spatiotemporal video segmentation. The objective is to partition a video sequence into homogeneous segments that can be used as "building blocks" for semantic video segmentation. The baseline framework is a Gaussian mixture model (GMM)-based video modeling approach that involves a six-dimensional spatiotemporal feature space. Specifically, we introduce the concept of frame saliency to quantify the relevancy of a video frame to the GMM-based spatiotemporal video modeling. This helps us use a small set of salient frames to facilitate the model training by reducing data redundancy and irrelevance. A modified expectation maximization algorithm is developed for simultaneous GMM training and frame saliency estimation, and the frames with the highest saliency values are extracted to refine the GMM estimation for video segmentation. Moreover, it is interesting to find that frame saliency can imply some object behaviors. This makes the proposed method also applicable to other frame-related video analysis tasks, such as key-frame extraction, video skimming, etc. Experiments on real videos demonstrate the effectiveness and efficiency of the proposed method.
Keywords :
Gaussian processes; expectation-maximisation algorithm; feature extraction; image segmentation; image sequences; statistical analysis; video signal processing; GMM training; Gaussian mixture model; expectation maximization algorithm; feature selection; frame saliency estimation; semantic video segmentation; spatiotemporal video modeling; spatiotemporal video segmentation; statistical generative model; video analysis; video sequence; Expectation maximization (EM); Gaussian mixture models (GMMs); feature selection; frame saliency; statistical video modeling; video segmentation; Algorithms; Artificial Intelligence; Computer Simulation; Data Interpretation, Statistical; Image Enhancement; Image Interpretation, Computer-Assisted; Models, Statistical; Pattern Recognition, Automated; Reproducibility of Results; Sensitivity and Specificity; Subtraction Technique; Video Recording;
fLanguage :
English
Journal_Title :
Image Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1057-7149
Type :
jour
DOI :
10.1109/TIP.2007.908283
Filename :
4358842
Link To Document :
بازگشت