Title :
Learning strategy fusion to acquire dynamic motion
Author :
Yamaguchi, Akihiko ; Takamatsu, Jun ; Ogasawara, Tsukasa
Author_Institution :
Grad. Sch. of Inf. Sci. Nara, Inst. of Sci. & Technol., Ikoma, Japan
Abstract :
This paper proposes a method to fuse learning strategies (LSs) in a reinforcement learning framework. In this method, some LSs are integrated for learning a single task of a single robot. The LSs consists of (1) LS-scratch: learning a policy from scratch, (2) LS-accelerating: learning a policy from a previously learned policy by accelerating motion speed parameters, and (3) LS-freeing: learning a policy from a previously learned policy by increasing the DoF (degree of freedom). The proposed LS fusion method enables (A) in the early stage of learning, LS fusion can select a suitable DoF configuration from a predefined set of DoF configurations, and (B) after a behavior module that learns from scratch converges, the LSs are applied to improve the policy. As a result, a robot can learn a complex task by starting with a simplified configuration, and then transferring the learned behaviors while increasing the difficulty. We introduce WF-DCOB proposed by Yamaguchi et al. for the LSs. We verify the proposed LS fusion method with a crawling task of a humanoid robot. The simulation experiments demonstrate the advantage of the proposed method compared to learning with a single learning module.
Keywords :
learning (artificial intelligence); robots; DoF configurations; LS fusion method; acquire dynamic motion; degree of freedom; learning strategy fusion; motion speed parameter; reinforcement learning; Acceleration; Function approximation; Humanoid robots; Joints; Learning; Vectors;
Conference_Titel :
Humanoid Robots (Humanoids), 2011 11th IEEE-RAS International Conference on
Conference_Location :
Bled
Print_ISBN :
978-1-61284-866-2
Electronic_ISBN :
2164-0572
DOI :
10.1109/Humanoids.2011.6100853