DocumentCode :
350278
Title :
Detecting news reporting using audio/visual information
Author :
Liu, Zhu ; Huang, Qian
Author_Institution :
Dept. of Res., AT&T Labs., Red Bank, NJ, USA
Volume :
3
fYear :
1999
fDate :
1999
Firstpage :
324
Abstract :
This paper proposes an integrated approach to discriminate news reporting from everything else in broadcast news data based on both audio and visual information. The separation of news reporting segments from others not only can provide useful indices for video streams but also serves as a pre-processing step for tasks such as speaker identification and speech recognition so that only speech segments are passed on for further processing. A set of audio and visual features are adopted that aim at capturing the intrinsic properties of the underlying classes. Four types of classifiers (threshold fuzzy, Gaussian Mixture Model based, and Support Vector Machine) are tested. Some of the experimental results are presented and discussed in the paper
Keywords :
speaker recognition; video databases; video signal processing; broadcast news data; news reporting; speaker identification; speech recognition; video streams; Automatic speech recognition; Broadcasting; Drives; Frequency; Multimedia communication; Speech recognition; Statistics; Support vector machine classification; Support vector machines; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Image Processing, 1999. ICIP 99. Proceedings. 1999 International Conference on
Conference_Location :
Kobe
Print_ISBN :
0-7803-5467-2
Type :
conf
DOI :
10.1109/ICIP.1999.817128
Filename :
817128
Link To Document :
بازگشت