Title :
Constructing action set from basis functions for reinforcement learning of robot control
Author :
Yamaguchi, Akihiko ; Takamatsu, Jun ; Ogasawara, Tsukasa
Author_Institution :
Graduate School of Information Science, Nara Institute of Science and Technology, 8916-5, Takayama, Ikoma, 630-0192, JAPAN
Abstract :
Continuous action sets are used in many reinforcement learning (RL) applications in robot control since the control input is continuous. However, discrete action sets also have the advantages of ease of implementation and compatibility with some sophisticated RL methods, such as the Dyna [1]. However, one of the problem is the absence of general principles on designing a discrete action set for robot control in higher dimensional input space. In this paper, we propose to construct a discrete action set given a set of basis functions (BFs). We designed the action set so that the size of the set is proportional to the number of the BFs. This method can exploit the function approximator´s nature, that is, in practical RL applications, the number of BFs does not increase exponentially with the dimension of the state space (e.g. [2]). Thus, the size of the proposed action set does not increase exponentially with the dimension of the input space. We apply an RL with the proposed action set to a robot navigation task and a crawling and a jumping tasks. The simulation results demonstrate that the proposed action set has the advantages of improved learning speed, and better ability to acquire performance, compared to a conventional discrete action set.
Keywords :
Humanoid robots; Information science; Learning; Legged locomotion; Navigation; Orbital robotics; Robot control; Robotics and automation; Space technology; State-space methods; Reinforcement learning; crawling; discrete action set; jumping; motion learning;
Conference_Titel :
Robotics and Automation, 2009. ICRA '09. IEEE International Conference on
Conference_Location :
Kobe
Print_ISBN :
978-1-4244-2788-8
Electronic_ISBN :
1050-4729
DOI :
10.1109/ROBOT.2009.5152840