DocumentCode :
250362
Title :
Learning spatial-semantic representations from natural language descriptions and scene classifications
Author :
Hemachandra, Sachithra ; Walter, Matthew R. ; Tellex, Stefanie ; Teller, Seth
Author_Institution :
Comput. Sci. & Artificial Intell. Lab., Massachusetts Inst. of Technol., Cambridge, MA, USA
fYear :
2014
fDate :
May 31 2014-June 7 2014
Firstpage :
2623
Lastpage :
2630
Abstract :
We describe a semantic mapping algorithm that learns human-centric environment models by interpreting natural language utterances. Underlying the approach is a coupled metric, topological, and semantic representation of the environment that enables the method to fuse information from natural language descriptions with low-level metric and appearance data. We extend earlier work with a novel formulation that incorporates spatial layout into a topological representation of the environment. We also describe a factor graph formulation of the semantic properties that encodes human-centric concepts such as type and colloquial name for each mapped region. The algorithm infers these properties by combining the user´s natural language descriptions with image- and laser-based scene classification. We also propose a mechanism to more effectively ground natural language descriptions of distant regions using semantic cues from other modalities. We describe how the algorithm employs this learned semantic information to propose valid topological hypotheses, leading to more accurate topological and metric maps. We demonstrate that integrating language with other sensor data increases the accuracy of the achieved spatial-semantic representation of the environment.
Keywords :
control engineering computing; graph theory; human-robot interaction; image classification; learning (artificial intelligence); natural language processing; robot vision; semantic networks; appearance data; factor graph formulation; human-centric environment models; image-based scene classification; laser-based scene classification; learning; low-level metric data; mapped region; metric representation; natural language descriptions; natural language utterances; semantic cues; semantic mapping algorithm; semantic properties; spatial layout; spatial-semantic representations; topological hypotheses; topological representation; Laser modes; Measurement; Natural languages; Robot sensing systems; Semantics; Topology;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Robotics and Automation (ICRA), 2014 IEEE International Conference on
Conference_Location :
Hong Kong
Type :
conf
DOI :
10.1109/ICRA.2014.6907235
Filename :
6907235
Link To Document :
بازگشت