DocumentCode :
2382301
Title :
Swarm reinforcement learning methods for problems with continuous state-action space
Author :
Iima, Hitoshi ; Kuroe, Yasuaki ; Emoto, Kazuo
Author_Institution :
Dept. of Inf. Sci., Kyoto Inst. of Technol., Kyoto, Japan
fYear :
2011
fDate :
9-12 Oct. 2011
Firstpage :
2173
Lastpage :
2180
Abstract :
We recently proposed swarm reinforcement learning methods in which multiple sets of an agent and an environment are prepared and the agents learn not only by individually performing a usual reinforcement learning method but also by exchanging information among them. Q-learning method has been used as the individual learning in the methods, and they have been applied to a problem with discrete state-action space. In the real world, however, there are many problems which are formulated as ones with continuous state-action space. This paper proposes swarm reinforcement learning methods based on an actor-critic method in order to acquire optimal policies rapidly for problems with continuous state-action space. The proposed methods are applied to a biped robot control problem, and their performance is examined through numerical experiments.
Keywords :
learning systems; legged locomotion; particle swarm optimisation; Q-learning method; actor-critic method; biped robot control problem; continuous state-action space; discrete state-action space; swarm reinforcement learning methods; Equations; Function approximation; Joints; Learning; Learning systems; Particle swarm optimization; Vectors; particle swarm optimization; reinforcement learning; swarm intelligence;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Systems, Man, and Cybernetics (SMC), 2011 IEEE International Conference on
Conference_Location :
Anchorage, AK
ISSN :
1062-922X
Print_ISBN :
978-1-4577-0652-3
Type :
conf
DOI :
10.1109/ICSMC.2011.6083999
Filename :
6083999
Link To Document :
بازگشت