• DocumentCode
    3087452
  • Title

    Association of Audio and Video Segmentations for Automatic Person Indexing

  • Author

    El Khoury, Elie ; Jaffré, Gaël ; Pinquier, Julien ; Sénac, Christine

  • Author_Institution
    Paul Sabatier Univ., Toulouse
  • fYear
    2007
  • fDate
    25-27 June 2007
  • Firstpage
    287
  • Lastpage
    294
  • Abstract
    In the audiovisual indexing context, we propose and experiment a method that, from an audio speaker segmentation and a video costume segmentation made on an audiovisual document, makes an automatic association between each voice and the images containing the corresponding visual person. This association can be used as a preprocessing step for existing applications like person identification systems. The first step consists in fusing, without any a priori knowledge, the two indexes produced by audio and video segmentations, in order to make the information brought by each of them more robust. Evaluation is done on a corpus composed of French TV broadcasts. If both audio and video streams are correctly segmented, this automatic association yields excellent results. When the two streams are oversegmented, our system permits to detect the main persons in term of duration of appearance.
  • Keywords
    audio signal processing; biometrics (access control); document image processing; image recognition; image segmentation; indexing; speaker recognition; video signal processing; French TV broadcast; audio segmentation; audio speaker segmentation; audiovisual document analysis; audiovisual indexing; automatic person indexing; person identification system; video costume segmentation; video segmentation; Automatic speech recognition; Bayesian methods; Face detection; Face recognition; Image segmentation; Indexing; Loudspeakers; Robustness; Streaming media; TV broadcasting;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Content-Based Multimedia Indexing, 2007. CBMI '07. International Workshop on
  • Conference_Location
    Bordeaux
  • Print_ISBN
    1-4244-1011-8
  • Electronic_ISBN
    1-4244-1011-8
  • Type

    conf

  • DOI
    10.1109/CBMI.2007.385424
  • Filename
    4275087