• DocumentCode
    1880560
  • Title

    A statistical framework for fusing mid-level perceptual features in news story segmentation

  • Author

    Winston, H. ; Hsu, Winston H M ; Chang, Shih-Fu

  • Author_Institution
    Dept. of Electr. Eng., Columbia Univ., New York, NY, USA
  • Volume
    2
  • fYear
    2003
  • fDate
    6-9 July 2003
  • Abstract
    News story segmentation is essential for video indexing, summarization and intelligence exploitation. In this paper, we present a general statistical framework, called exponential model or maximum entropy model that can systematically select the most significant mid-level features of various types (visual, audio, and semantic) and learn the optimal ways in fusing their combinations in story segmentation. The model utilizes a family of weighted, exponential functions to account for the contributions from different features. The Kullbak-Leibler divergence measure is used in an optimization procedure to iteratively estimate the model parameters, and automatically select the optimal features. The framework is scalable in incorporating new features and adapting to new domains and also discovers how these feature sets contribute to the segmentation work. When tested on foreign news programs, the proposed techniques achieve significant performance improvement over prior work using ad hoc algorithms and slightly better gain over the state of the art using HMM-based models.
  • Keywords
    hidden Markov models; image segmentation; maximum entropy methods; parameter estimation; television broadcasting; video signal processing; ad hoc algorithms; exponential model; hidden Markov model; maximum entropy model; midlevel perceptual features fusing; news story segmentation; parameter estimation; video indexing; video intelligence exploitation; video summarization; Acoustic signal detection; Clustering algorithms; Face detection; Gunshot detection systems; Hidden Markov models; Image segmentation; Indexing; Real time systems; Speech; Statistics;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Multimedia and Expo, 2003. ICME '03. Proceedings. 2003 International Conference on
  • Print_ISBN
    0-7803-7965-9
  • Type

    conf

  • DOI
    10.1109/ICME.2003.1221641
  • Filename
    1221641