Title : 
Reward-driven learning of sensorimotor laws and visual features
         
        
            Author : 
Kleesiek, Jens ; Engel, Andreas K. ; Weber, Cornelius ; Wermter, Stefan
         
        
            Author_Institution : 
Dept. of Neurophysiol. & Pathophysiology, Univ. Med. Center Hamburg-Eppendorf, Hamburg, Germany
         
        
        
        
        
        
            Abstract : 
A frequently reoccurring task of humanoid robots is the autonomous navigation towards a goal position. Here we present a simulation of a purely vision-based docking behavior in a 3-D physical world. The robot learns sensorimotor laws and visual features simultaneously and exploits both for navigation towards its virtual target region. The control laws are trained using a two-layer network consisting of a feature (sensory) layer that feeds into an action (Q-value) layer. A reinforcement feedback signal (delta) modulates not only the action but at the same time the feature weights. Under this influence, the network learns interpretable visual features and assigns goal-directed actions successfully. This is a step towards investigating how reinforcement learning can be linked to visual perception.
         
        
            Keywords : 
humanoid robots; learning (artificial intelligence); navigation; robot vision; 3D physical world; autonomous navigation; humanoid robots; reinforcement feedback signal; reinforcement learning; reward-driven learning; sensorimotor laws; two-layer network; vision-based docking behavior; visual features; visual perception; Visualization;
         
        
        
        
            Conference_Titel : 
Development and Learning (ICDL), 2011 IEEE International Conference on
         
        
            Conference_Location : 
Frankfurt am Main
         
        
        
            Print_ISBN : 
978-1-61284-989-8
         
        
        
            DOI : 
10.1109/DEVLRN.2011.6037358