An actor-critic approach for learning cooperative behaviors of multiagent seesaw balancing problems

Author

Kawakami, Takashi ; Kinoshita, Masahiro ; Watanabe, Michiko ; Takatori, Norihiko ; Furukawa, Masashi

Author_Institution

Dept. of Inf. Design, Hokkaido Inst. of Technol., Sapporo, Japan

Volume

1

fYear

2005

fDate

10-12 Oct. 2005

Firstpage

109

Abstract

This paper proposes a new approach to realize a reinforcement learning scheme for autonomous multiple agents system. In our approach, we treat the cooperative agents systems in which there are multiple autonomous mobile robots, and the seesaw balancing task is given. This problem is an example of corresponding tasks to find the appropriate locations for multiple mobile robots. Each robot agent on a seesaw keeps being balanced state. As a most useful algorithm, the Q-learning method is well known. However, feasible action values of robot agents must be categorized into some discrete action values. Therefore, in this study, the actor-critic method is applied to treat continuous values of agents´ actions. Each robot agent has a set of normal distribution, that determines a distance of the robot movement for a corresponding state of the seesaw system. Based on a result of movement in this system, the normal distribution is modified by actor-critic learning method. The simulation result shows the effectiveness of our approaching method.

Keywords

learning (artificial intelligence); mobile robots; multi-agent systems; normal distribution; Q-learning; actor-critic approach; autonomous multiple agent; cooperative behavior learning; multiagent seesaw balancing problem; multiagent systems; multiple autonomous mobile robot; multiple mobile robot; reinforcement learning; robot agent; robot movement; Autonomous agents; Educational institutions; Environmental management; Gaussian distribution; Human robot interaction; Learning systems; Mobile robots; Multiagent systems; Positron emission tomography; Service robots; Actor-critic; cooperative behaviors; multiagent systems; reinforcement learning; seesaw balancing problems;

fLanguage

English

Publisher

ieee

Conference_Titel

Systems, Man and Cybernetics, 2005 IEEE International Conference on

Print_ISBN

0-7803-9298-1

Type

conf

DOI

10.1109/ICSMC.2005.1571130

Filename

1571130