• DocumentCode
    944809
  • Title

    Selecting Salient Frames for Spatiotemporal Video Modeling and Segmentation

  • Author

    Song, Xiaomu ; Fan, Guoliang

  • Author_Institution
    Oklahoma State Univ., Stillwater
  • Volume
    16
  • Issue
    12
  • fYear
    2007
  • Firstpage
    3035
  • Lastpage
    3046
  • Abstract
    We propose a new statistical generative model for spatiotemporal video segmentation. The objective is to partition a video sequence into homogeneous segments that can be used as "building blocks" for semantic video segmentation. The baseline framework is a Gaussian mixture model (GMM)-based video modeling approach that involves a six-dimensional spatiotemporal feature space. Specifically, we introduce the concept of frame saliency to quantify the relevancy of a video frame to the GMM-based spatiotemporal video modeling. This helps us use a small set of salient frames to facilitate the model training by reducing data redundancy and irrelevance. A modified expectation maximization algorithm is developed for simultaneous GMM training and frame saliency estimation, and the frames with the highest saliency values are extracted to refine the GMM estimation for video segmentation. Moreover, it is interesting to find that frame saliency can imply some object behaviors. This makes the proposed method also applicable to other frame-related video analysis tasks, such as key-frame extraction, video skimming, etc. Experiments on real videos demonstrate the effectiveness and efficiency of the proposed method.
  • Keywords
    Gaussian processes; expectation-maximisation algorithm; feature extraction; image segmentation; image sequences; statistical analysis; video signal processing; GMM training; Gaussian mixture model; expectation maximization algorithm; feature selection; frame saliency estimation; semantic video segmentation; spatiotemporal video modeling; spatiotemporal video segmentation; statistical generative model; video analysis; video sequence; Expectation maximization (EM); Gaussian mixture models (GMMs); feature selection; frame saliency; statistical video modeling; video segmentation; Algorithms; Artificial Intelligence; Computer Simulation; Data Interpretation, Statistical; Image Enhancement; Image Interpretation, Computer-Assisted; Models, Statistical; Pattern Recognition, Automated; Reproducibility of Results; Sensitivity and Specificity; Subtraction Technique; Video Recording;
  • fLanguage
    English
  • Journal_Title
    Image Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1057-7149
  • Type

    jour

  • DOI
    10.1109/TIP.2007.908283
  • Filename
    4358842