• DocumentCode
    394485
  • Title

    The indexing of persons in news sequences using audio-visual data

  • Author

    Albiol, Alberto ; Torres, Luis ; Delp, Edward J.

  • Author_Institution
    Politechnic Univ. of Valencia, Spain
  • Volume
    3
  • fYear
    2003
  • fDate
    6-10 April 2003
  • Abstract
    We describe a video indexing system that automatically searches for a specific person in a news sequence. The proposed approach combines audio and video confidence values extracted from speaker and face recognition analysis. The system also incorporates a shot selection module that seeks for anchors, where the person on the scene is likely speaking. The system has been extensively tested on several news sequences with very good recognition rates.
  • Keywords
    audio signal processing; face recognition; image classification; image sequences; speaker recognition; video signal processing; audio confidence values; audio-visual data; face recognition; news sequences; person indexing; shot selection module; speaker recognition; video confidence values; video indexing system; Data mining; Degradation; Electronic mail; Face recognition; Gunshot detection systems; Humans; Indexing; Layout; MPEG 7 Standard; System testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-7663-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.2003.1199126
  • Filename
    1199126