• DocumentCode
    2399768
  • Title

    Automatic face naming with caption-based supervision

  • Author

    Guillaumin, Matthieu ; Mensink, Thomas ; Verbeek, Jakob ; Schmid, Cordelia

  • Author_Institution
    LEAR team, INRIA, Grenoble
  • fYear
    2008
  • fDate
    23-28 June 2008
  • Firstpage
    1
  • Lastpage
    8
  • Abstract
    We consider two scenarios of naming people in databases of news photos with captions: (i) finding faces of a single person, and (ii) assigning names to all faces. We combine an initial text-based step, that restricts the name assigned to a face to the set of names appearing in the caption, with a second step that analyzes visual features of faces. By searching for groups of highly similar faces that can be associated with a name, the results of purely text-based search can be greatly ameliorated. We improve a recent graph-based approach, in which nodes correspond to faces and edges connect highly similar faces. We introduce constraints when optimizing the objective function, and propose improvements in the low-level methods used to construct the graphs. Furthermore, we generalize the graph-based approach to face naming in the full data set. In this multi-person naming case the optimization quickly becomes computationally demanding, and we present an important speed-up using graph-flows to compute the optimal name assignments in documents. Generative models have previously been proposed to solve the multi-person naming task. We compare the generative and graph-based methods in both scenarios, and find significantly better performance using the graph-based methods in both cases.
  • Keywords
    document handling; visual databases; automatic face naming; caption-based supervision; graph-based approach; multi-person naming task; objective function; purely text-based search; text-based step; Constraint optimization; Digital multimedia broadcasting; Face detection; Multimedia communication; Publishing; Search engines; Spatial databases; Visual databases; Web search; YouTube;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Vision and Pattern Recognition, 2008. CVPR 2008. IEEE Conference on
  • Conference_Location
    Anchorage, AK
  • ISSN
    1063-6919
  • Print_ISBN
    978-1-4244-2242-5
  • Electronic_ISBN
    1063-6919
  • Type

    conf

  • DOI
    10.1109/CVPR.2008.4587603
  • Filename
    4587603