• DocumentCode
    350278
  • Title

    Detecting news reporting using audio/visual information

  • Author

    Liu, Zhu ; Huang, Qian

  • Author_Institution
    Dept. of Res., AT&T Labs., Red Bank, NJ, USA
  • Volume
    3
  • fYear
    1999
  • fDate
    1999
  • Firstpage
    324
  • Abstract
    This paper proposes an integrated approach to discriminate news reporting from everything else in broadcast news data based on both audio and visual information. The separation of news reporting segments from others not only can provide useful indices for video streams but also serves as a pre-processing step for tasks such as speaker identification and speech recognition so that only speech segments are passed on for further processing. A set of audio and visual features are adopted that aim at capturing the intrinsic properties of the underlying classes. Four types of classifiers (threshold fuzzy, Gaussian Mixture Model based, and Support Vector Machine) are tested. Some of the experimental results are presented and discussed in the paper
  • Keywords
    speaker recognition; video databases; video signal processing; broadcast news data; news reporting; speaker identification; speech recognition; video streams; Automatic speech recognition; Broadcasting; Drives; Frequency; Multimedia communication; Speech recognition; Statistics; Support vector machine classification; Support vector machines; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Image Processing, 1999. ICIP 99. Proceedings. 1999 International Conference on
  • Conference_Location
    Kobe
  • Print_ISBN
    0-7803-5467-2
  • Type

    conf

  • DOI
    10.1109/ICIP.1999.817128
  • Filename
    817128