DocumentCode :
2915590
Title :
Baby talk: Understanding and generating simple image descriptions
Author :
Kulkarni, Girish ; Premraj, Visruth ; Dhar, Sagnik ; Li, Siming ; Choi, Yejin ; Berg, Alexander C. ; Berg, Tamara L.
Author_Institution :
Stony Brook Univ., Stony Brook, NY, USA
fYear :
2011
fDate :
20-25 June 2011
Firstpage :
1601
Lastpage :
1608
Abstract :
We posit that visually descriptive language offers computer vision researchers both information about the world, and information about how people describe the world. The potential benefit from this source is made more significant due to the enormous amount of language data easily available today. We present a system to automatically generate natural language descriptions from images that exploits both statistics gleaned from parsing large quantities of text data and recognition algorithms from computer vision. The system is very effective at producing relevant sentences for images. It also generates descriptions that are notably more true to the specific image content than previous work.
Keywords :
computer vision; natural language processing; statistics; text analysis; visual languages; baby talk; computer vision researchers; image description generation; image description understanding; natural language description generation; statistics; text data parsing; visually descriptive language; Computer vision; Detectors; Image recognition; Labeling; Natural languages; Object detection; Visualization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Vision and Pattern Recognition (CVPR), 2011 IEEE Conference on
Conference_Location :
Providence, RI
ISSN :
1063-6919
Print_ISBN :
978-1-4577-0394-2
Type :
conf
DOI :
10.1109/CVPR.2011.5995466
Filename :
5995466
Link To Document :
بازگشت