DocumentCode :
3525930
Title :
Robots with language: Multi-label visual recognition using NLP
Author :
Yezhou Yang ; Teo, Ching L. ; Fermuller, Cornelia ; Aloimonos, Yiannis
Author_Institution :
Comput. Vision Lab., Univ. of Maryland, College Park, MD, USA
fYear :
2013
fDate :
6-10 May 2013
Firstpage :
4256
Lastpage :
4262
Abstract :
There has been a recent interest in utilizing contextual knowledge to improve multi-label visual recognition for intelligent agents like robots. Natural Language Processing (NLP) can give us labels, the correlation of labels, and the ontological knowledge about them, so we can automate the acquisition of contextual knowledge. In this paper we show how to use tools from NLP in conjunction with Vision to improve visual recognition. There are two major approaches: First, different language databases organize words according to various semantic concepts. Using these, we can build special purpose databases that can predict the labels involved given a certain context. Here we build a knowledge base for the purpose of describing common daily activities. Second, statistical language tools can provide the correlations of different labels. We show a way to learn a language model from large corpus data that exploits these correlations and propose a general optimization scheme to integrate the language model into the system. Experiments conducted on three multi-label everyday recognition tasks support the effectiveness and efficiency of our approach, with significant gains in recognition accuracies when correlation information is used.
Keywords :
computational linguistics; image recognition; intelligent robots; natural language processing; ontologies (artificial intelligence); robot vision; NLP; contextual knowledge acquisition; corpus data; correlation information; general optimization scheme; intelligent agents; intelligent robots; label correlation; language databases; language model integration; multilabel everyday recognition tasks; multilabel visual recognition; natural language processing; ontological knowledge; recognition accuracy; semantic concepts; special purpose databases; statistical language tools; Accuracy; Correlation; Databases; Hidden Markov models; Knowledge based systems; Robots; Visualization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Robotics and Automation (ICRA), 2013 IEEE International Conference on
Conference_Location :
Karlsruhe
ISSN :
1050-4729
Print_ISBN :
978-1-4673-5641-1
Type :
conf
DOI :
10.1109/ICRA.2013.6631179
Filename :
6631179
Link To Document :
بازگشت