مرکز منطقه ای اطلاع رساني علوم و فناوري - Detecting news reporting using audio/visual information

DocumentCode :

350278

Title :

Detecting news reporting using audio/visual information

Author :

Liu, Zhu ; Huang, Qian

Author_Institution :

Dept. of Res., AT&T Labs., Red Bank, NJ, USA

Volume :

fYear :

1999

fDate :

1999

Firstpage :

324

Abstract :

This paper proposes an integrated approach to discriminate news reporting from everything else in broadcast news data based on both audio and visual information. The separation of news reporting segments from others not only can provide useful indices for video streams but also serves as a pre-processing step for tasks such as speaker identification and speech recognition so that only speech segments are passed on for further processing. A set of audio and visual features are adopted that aim at capturing the intrinsic properties of the underlying classes. Four types of classifiers (threshold fuzzy, Gaussian Mixture Model based, and Support Vector Machine) are tested. Some of the experimental results are presented and discussed in the paper

Keywords :

speaker recognition; video databases; video signal processing; broadcast news data; news reporting; speaker identification; speech recognition; video streams; Automatic speech recognition; Broadcasting; Drives; Frequency; Multimedia communication; Speech recognition; Statistics; Support vector machine classification; Support vector machines; Testing;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Image Processing, 1999. ICIP 99. Proceedings. 1999 International Conference on

Conference_Location :

Kobe

Print_ISBN :

0-7803-5467-2

Type :

conf

DOI :

10.1109/ICIP.1999.817128

Filename :

817128

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=350278