DocumentCode
350278
Title
Detecting news reporting using audio/visual information
Author
Liu, Zhu ; Huang, Qian
Author_Institution
Dept. of Res., AT&T Labs., Red Bank, NJ, USA
Volume
3
fYear
1999
fDate
1999
Firstpage
324
Abstract
This paper proposes an integrated approach to discriminate news reporting from everything else in broadcast news data based on both audio and visual information. The separation of news reporting segments from others not only can provide useful indices for video streams but also serves as a pre-processing step for tasks such as speaker identification and speech recognition so that only speech segments are passed on for further processing. A set of audio and visual features are adopted that aim at capturing the intrinsic properties of the underlying classes. Four types of classifiers (threshold fuzzy, Gaussian Mixture Model based, and Support Vector Machine) are tested. Some of the experimental results are presented and discussed in the paper
Keywords
speaker recognition; video databases; video signal processing; broadcast news data; news reporting; speaker identification; speech recognition; video streams; Automatic speech recognition; Broadcasting; Drives; Frequency; Multimedia communication; Speech recognition; Statistics; Support vector machine classification; Support vector machines; Testing;
fLanguage
English
Publisher
ieee
Conference_Titel
Image Processing, 1999. ICIP 99. Proceedings. 1999 International Conference on
Conference_Location
Kobe
Print_ISBN
0-7803-5467-2
Type
conf
DOI
10.1109/ICIP.1999.817128
Filename
817128
Link To Document