Title : 
Multi-scale Q-learning of a mobile robot in dynamic environments
         
        
            Author : 
Takase, Norio ; Kubota, Naoyuki ; Baba, N.
         
        
            Author_Institution : 
Tokyo Metropolitan Univ., Tokyo, Japan
         
        
        
        
        
        
            Abstract : 
This paper deals with a state-dependent learning method of a mobile robot in dynamic and unknown environments. The aim of a mobile robot is to find the optimal path in the task of maze navigation on a grid world. Various types of reinforcement learning methods have been proposed, but it is very difficult to design the granularity (resolution) of states in search space. Therefore, we propose a multi-scale value function to enhance the initial learning of reinforcement learning. First, we compare the performance of temporal difference (TD) learning and Q-learning in dynamic environment. Here we assume several obstacles disappear in the grid world with an existence probability. Several experimental results show the effectiveness of the proposed method.
         
        
            Keywords : 
collision avoidance; learning (artificial intelligence); mobile robots; probability; TD learning; existence probability; granularity design; grid world; maze navigation; mobile robot; multiscale Q-learning; multiscale value function; obstacle avoidance; reinforcement learning method; state-dependent learning method; temporal difference learning; Dynamic Environments; Intelligent Robitics; Multi-scale Value Function; Reinforcement Learning;
         
        
        
        
            Conference_Titel : 
Soft Computing and Intelligent Systems (SCIS) and 13th International Symposium on Advanced Intelligent Systems (ISIS), 2012 Joint 6th International Conference on
         
        
            Conference_Location : 
Kobe
         
        
            Print_ISBN : 
978-1-4673-2742-8
         
        
        
            DOI : 
10.1109/SCIS-ISIS.2012.6505358