DocumentCode :
2399768
Title :
Automatic face naming with caption-based supervision
Author :
Guillaumin, Matthieu ; Mensink, Thomas ; Verbeek, Jakob ; Schmid, Cordelia
Author_Institution :
LEAR team, INRIA, Grenoble
fYear :
2008
fDate :
23-28 June 2008
Firstpage :
1
Lastpage :
8
Abstract :
We consider two scenarios of naming people in databases of news photos with captions: (i) finding faces of a single person, and (ii) assigning names to all faces. We combine an initial text-based step, that restricts the name assigned to a face to the set of names appearing in the caption, with a second step that analyzes visual features of faces. By searching for groups of highly similar faces that can be associated with a name, the results of purely text-based search can be greatly ameliorated. We improve a recent graph-based approach, in which nodes correspond to faces and edges connect highly similar faces. We introduce constraints when optimizing the objective function, and propose improvements in the low-level methods used to construct the graphs. Furthermore, we generalize the graph-based approach to face naming in the full data set. In this multi-person naming case the optimization quickly becomes computationally demanding, and we present an important speed-up using graph-flows to compute the optimal name assignments in documents. Generative models have previously been proposed to solve the multi-person naming task. We compare the generative and graph-based methods in both scenarios, and find significantly better performance using the graph-based methods in both cases.
Keywords :
document handling; visual databases; automatic face naming; caption-based supervision; graph-based approach; multi-person naming task; objective function; purely text-based search; text-based step; Constraint optimization; Digital multimedia broadcasting; Face detection; Multimedia communication; Publishing; Search engines; Spatial databases; Visual databases; Web search; YouTube;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Vision and Pattern Recognition, 2008. CVPR 2008. IEEE Conference on
Conference_Location :
Anchorage, AK
ISSN :
1063-6919
Print_ISBN :
978-1-4244-2242-5
Electronic_ISBN :
1063-6919
Type :
conf
DOI :
10.1109/CVPR.2008.4587603
Filename :
4587603
Link To Document :
بازگشت