Title :
Swarm reinforcement learning methods for problems with continuous state-action space
Author :
Iima, Hitoshi ; Kuroe, Yasuaki ; Emoto, Kazuo
Author_Institution :
Dept. of Inf. Sci., Kyoto Inst. of Technol., Kyoto, Japan
Abstract :
We recently proposed swarm reinforcement learning methods in which multiple sets of an agent and an environment are prepared and the agents learn not only by individually performing a usual reinforcement learning method but also by exchanging information among them. Q-learning method has been used as the individual learning in the methods, and they have been applied to a problem with discrete state-action space. In the real world, however, there are many problems which are formulated as ones with continuous state-action space. This paper proposes swarm reinforcement learning methods based on an actor-critic method in order to acquire optimal policies rapidly for problems with continuous state-action space. The proposed methods are applied to a biped robot control problem, and their performance is examined through numerical experiments.
Keywords :
learning systems; legged locomotion; particle swarm optimisation; Q-learning method; actor-critic method; biped robot control problem; continuous state-action space; discrete state-action space; swarm reinforcement learning methods; Equations; Function approximation; Joints; Learning; Learning systems; Particle swarm optimization; Vectors; particle swarm optimization; reinforcement learning; swarm intelligence;
Conference_Titel :
Systems, Man, and Cybernetics (SMC), 2011 IEEE International Conference on
Conference_Location :
Anchorage, AK
Print_ISBN :
978-1-4577-0652-3
DOI :
10.1109/ICSMC.2011.6083999