• DocumentCode
    1858257
  • Title

    A hidden Markov model framework for video segmentation using audio and image features

  • Author

    Boreczky, J.S. ; Wilcox, Lynn D.

  • Author_Institution
    FX Palo Alto Lab., Palo Alto, CA, USA
  • Volume
    6
  • fYear
    1998
  • fDate
    12-15 May 1998
  • Firstpage
    3741
  • Abstract
    This paper describes a technique for segmenting video using hidden Markov models (HMM). Video is segmented into regions defined by shots, shot boundaries, and camera movement within shots. Features for segmentation include an image-based distance between adjacent video frames, an audio distance based on the acoustic difference in intervals just before and after the frames, and an estimate of motion between the two frames. Typical video segmentation algorithms classify shot boundaries by computing an image-based distance between adjacent frames and comparing this distance to fixed, manually determined thresholds. Motion and audio information is used separately. In contrast, our segmentation technique allows features to be combined within the HMM framework. Further, thresholds are not required since automatically trained HMMs take their place. This algorithm has been tested on a video data base, and has been shown to improve the accuracy of video segmentation over standard threshold-based systems
  • Keywords
    feature extraction; hidden Markov models; image segmentation; image sequences; motion estimation; video signal processing; acoustic difference; adjacent video frames; audio distance; audio features; automatically trained HMM; camera movement; hidden Markov model; image features; image-based distance; motion estimation; shot boundaries classification; shots; threshold-based systems; video data base; video segmentation algorithms; Cameras; Hidden Markov models; Histograms; Image segmentation; Indexing; Laboratories; Motion detection; Motion estimation; System testing; TV;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 1998. Proceedings of the 1998 IEEE International Conference on
  • Conference_Location
    Seattle, WA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-4428-6
  • Type

    conf

  • DOI
    10.1109/ICASSP.1998.679697
  • Filename
    679697