Title : 
Learning strategy fusion to acquire dynamic motion
         
        
            Author : 
Yamaguchi, Akihiko ; Takamatsu, Jun ; Ogasawara, Tsukasa
         
        
            Author_Institution : 
Grad. Sch. of Inf. Sci. Nara, Inst. of Sci. & Technol., Ikoma, Japan
         
        
        
        
        
        
            Abstract : 
This paper proposes a method to fuse learning strategies (LSs) in a reinforcement learning framework. In this method, some LSs are integrated for learning a single task of a single robot. The LSs consists of (1) LS-scratch: learning a policy from scratch, (2) LS-accelerating: learning a policy from a previously learned policy by accelerating motion speed parameters, and (3) LS-freeing: learning a policy from a previously learned policy by increasing the DoF (degree of freedom). The proposed LS fusion method enables (A) in the early stage of learning, LS fusion can select a suitable DoF configuration from a predefined set of DoF configurations, and (B) after a behavior module that learns from scratch converges, the LSs are applied to improve the policy. As a result, a robot can learn a complex task by starting with a simplified configuration, and then transferring the learned behaviors while increasing the difficulty. We introduce WF-DCOB proposed by Yamaguchi et al. for the LSs. We verify the proposed LS fusion method with a crawling task of a humanoid robot. The simulation experiments demonstrate the advantage of the proposed method compared to learning with a single learning module.
         
        
            Keywords : 
learning (artificial intelligence); robots; DoF configurations; LS fusion method; acquire dynamic motion; degree of freedom; learning strategy fusion; motion speed parameter; reinforcement learning; Acceleration; Function approximation; Humanoid robots; Joints; Learning; Vectors;
         
        
        
        
            Conference_Titel : 
Humanoid Robots (Humanoids), 2011 11th IEEE-RAS International Conference on
         
        
            Conference_Location : 
Bled
         
        
        
            Print_ISBN : 
978-1-61284-866-2
         
        
            Electronic_ISBN : 
2164-0572
         
        
        
            DOI : 
10.1109/Humanoids.2011.6100853