DocumentCode
1880560
Title
A statistical framework for fusing mid-level perceptual features in news story segmentation
Author
Winston, H. ; Hsu, Winston H M ; Chang, Shih-Fu
Author_Institution
Dept. of Electr. Eng., Columbia Univ., New York, NY, USA
Volume
2
fYear
2003
fDate
6-9 July 2003
Abstract
News story segmentation is essential for video indexing, summarization and intelligence exploitation. In this paper, we present a general statistical framework, called exponential model or maximum entropy model that can systematically select the most significant mid-level features of various types (visual, audio, and semantic) and learn the optimal ways in fusing their combinations in story segmentation. The model utilizes a family of weighted, exponential functions to account for the contributions from different features. The Kullbak-Leibler divergence measure is used in an optimization procedure to iteratively estimate the model parameters, and automatically select the optimal features. The framework is scalable in incorporating new features and adapting to new domains and also discovers how these feature sets contribute to the segmentation work. When tested on foreign news programs, the proposed techniques achieve significant performance improvement over prior work using ad hoc algorithms and slightly better gain over the state of the art using HMM-based models.
Keywords
hidden Markov models; image segmentation; maximum entropy methods; parameter estimation; television broadcasting; video signal processing; ad hoc algorithms; exponential model; hidden Markov model; maximum entropy model; midlevel perceptual features fusing; news story segmentation; parameter estimation; video indexing; video intelligence exploitation; video summarization; Acoustic signal detection; Clustering algorithms; Face detection; Gunshot detection systems; Hidden Markov models; Image segmentation; Indexing; Real time systems; Speech; Statistics;
fLanguage
English
Publisher
ieee
Conference_Titel
Multimedia and Expo, 2003. ICME '03. Proceedings. 2003 International Conference on
Print_ISBN
0-7803-7965-9
Type
conf
DOI
10.1109/ICME.2003.1221641
Filename
1221641
Link To Document