مرکز منطقه ای اطلاع رساني علوم و فناوري - Swarm reinforcement learning methods for problems with continuous state-action space

DocumentCode :

2382301

Title :

Swarm reinforcement learning methods for problems with continuous state-action space

Author :

Iima, Hitoshi ; Kuroe, Yasuaki ; Emoto, Kazuo

Author_Institution :

Dept. of Inf. Sci., Kyoto Inst. of Technol., Kyoto, Japan

fYear :

2011

fDate :

9-12 Oct. 2011

Firstpage :

2173

Lastpage :

2180

Abstract :

We recently proposed swarm reinforcement learning methods in which multiple sets of an agent and an environment are prepared and the agents learn not only by individually performing a usual reinforcement learning method but also by exchanging information among them. Q-learning method has been used as the individual learning in the methods, and they have been applied to a problem with discrete state-action space. In the real world, however, there are many problems which are formulated as ones with continuous state-action space. This paper proposes swarm reinforcement learning methods based on an actor-critic method in order to acquire optimal policies rapidly for problems with continuous state-action space. The proposed methods are applied to a biped robot control problem, and their performance is examined through numerical experiments.

Keywords :

learning systems; legged locomotion; particle swarm optimisation; Q-learning method; actor-critic method; biped robot control problem; continuous state-action space; discrete state-action space; swarm reinforcement learning methods; Equations; Function approximation; Joints; Learning; Learning systems; Particle swarm optimization; Vectors; particle swarm optimization; reinforcement learning; swarm intelligence;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Systems, Man, and Cybernetics (SMC), 2011 IEEE International Conference on

Conference_Location :

Anchorage, AK

ISSN :

1062-922X

Print_ISBN :

978-1-4577-0652-3

Type :

conf

DOI :

10.1109/ICSMC.2011.6083999

Filename :

6083999

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2382301