• DocumentCode
    177576
  • Title

    Multiple-view constrained clustering for unsupervised face identification in TV-broadcast

  • Author

    Bendris, Meriem ; Favre, Benoit ; Charlet, D. ; Damnati, Geraldine ; Auguste, Remi

  • Author_Institution
    Aix Marseille Univ., Marseille, France
  • fYear
    2014
  • fDate
    4-9 May 2014
  • Firstpage
    494
  • Lastpage
    498
  • Abstract
    Our goal is to automatically identify faces in TV broadcast without a pre-defined dictionary of identities. Most methods are based on identity detection (from OCR and ASR) and require a propagation strategy based on visual clustering. In TV content, people appear with many variations making the clustering difficult. In this case, speaker clustering can be a reliable link for face clustering. We propose in this paper to build automatically an incomplete speaker-face mapping based on local evidence of OCR and Lip activity links. Then, we propose schemes of speaker constraints propagation to the face constrained-clustering problem. Experiments performed on the REPERE corpus show an improvement of face identification by propagating names to face clusters (+3.7% F-measure compared to the baseline).
  • Keywords
    face recognition; speaker recognition; television broadcasting; ASR; OCR; REPERE corpus; TV broadcast; TV content; face constrained-clustering problem; identity detection; incomplete speaker-face mapping; lip activity links; multiple view constrained clustering; speaker clustering; speaker constraints propagation; unsupervised face identification; visual clustering; Clustering algorithms; Face; Optical character recognition software; Speech; TV; TV broadcasting; Videos;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
  • Conference_Location
    Florence
  • Type

    conf

  • DOI
    10.1109/ICASSP.2014.6853645
  • Filename
    6853645