Title :
Reward-driven learning of sensorimotor laws and visual features
Author :
Kleesiek, Jens ; Engel, Andreas K. ; Weber, Cornelius ; Wermter, Stefan
Author_Institution :
Dept. of Neurophysiol. & Pathophysiology, Univ. Med. Center Hamburg-Eppendorf, Hamburg, Germany
Abstract :
A frequently reoccurring task of humanoid robots is the autonomous navigation towards a goal position. Here we present a simulation of a purely vision-based docking behavior in a 3-D physical world. The robot learns sensorimotor laws and visual features simultaneously and exploits both for navigation towards its virtual target region. The control laws are trained using a two-layer network consisting of a feature (sensory) layer that feeds into an action (Q-value) layer. A reinforcement feedback signal (delta) modulates not only the action but at the same time the feature weights. Under this influence, the network learns interpretable visual features and assigns goal-directed actions successfully. This is a step towards investigating how reinforcement learning can be linked to visual perception.
Keywords :
humanoid robots; learning (artificial intelligence); navigation; robot vision; 3D physical world; autonomous navigation; humanoid robots; reinforcement feedback signal; reinforcement learning; reward-driven learning; sensorimotor laws; two-layer network; vision-based docking behavior; visual features; visual perception; Visualization;
Conference_Titel :
Development and Learning (ICDL), 2011 IEEE International Conference on
Conference_Location :
Frankfurt am Main
Print_ISBN :
978-1-61284-989-8
DOI :
10.1109/DEVLRN.2011.6037358