Title :
Robots with language: Multi-label visual recognition using NLP
Author :
Yezhou Yang ; Teo, Ching L. ; Fermuller, Cornelia ; Aloimonos, Yiannis
Author_Institution :
Comput. Vision Lab., Univ. of Maryland, College Park, MD, USA
Abstract :
There has been a recent interest in utilizing contextual knowledge to improve multi-label visual recognition for intelligent agents like robots. Natural Language Processing (NLP) can give us labels, the correlation of labels, and the ontological knowledge about them, so we can automate the acquisition of contextual knowledge. In this paper we show how to use tools from NLP in conjunction with Vision to improve visual recognition. There are two major approaches: First, different language databases organize words according to various semantic concepts. Using these, we can build special purpose databases that can predict the labels involved given a certain context. Here we build a knowledge base for the purpose of describing common daily activities. Second, statistical language tools can provide the correlations of different labels. We show a way to learn a language model from large corpus data that exploits these correlations and propose a general optimization scheme to integrate the language model into the system. Experiments conducted on three multi-label everyday recognition tasks support the effectiveness and efficiency of our approach, with significant gains in recognition accuracies when correlation information is used.
Keywords :
computational linguistics; image recognition; intelligent robots; natural language processing; ontologies (artificial intelligence); robot vision; NLP; contextual knowledge acquisition; corpus data; correlation information; general optimization scheme; intelligent agents; intelligent robots; label correlation; language databases; language model integration; multilabel everyday recognition tasks; multilabel visual recognition; natural language processing; ontological knowledge; recognition accuracy; semantic concepts; special purpose databases; statistical language tools; Accuracy; Correlation; Databases; Hidden Markov models; Knowledge based systems; Robots; Visualization;
Conference_Titel :
Robotics and Automation (ICRA), 2013 IEEE International Conference on
Conference_Location :
Karlsruhe
Print_ISBN :
978-1-4673-5641-1
DOI :
10.1109/ICRA.2013.6631179