DocumentCode
447258
Title
An actor-critic approach for learning cooperative behaviors of multiagent seesaw balancing problems
Author
Kawakami, Takashi ; Kinoshita, Masahiro ; Watanabe, Michiko ; Takatori, Norihiko ; Furukawa, Masashi
Author_Institution
Dept. of Inf. Design, Hokkaido Inst. of Technol., Sapporo, Japan
Volume
1
fYear
2005
fDate
10-12 Oct. 2005
Firstpage
109
Abstract
This paper proposes a new approach to realize a reinforcement learning scheme for autonomous multiple agents system. In our approach, we treat the cooperative agents systems in which there are multiple autonomous mobile robots, and the seesaw balancing task is given. This problem is an example of corresponding tasks to find the appropriate locations for multiple mobile robots. Each robot agent on a seesaw keeps being balanced state. As a most useful algorithm, the Q-learning method is well known. However, feasible action values of robot agents must be categorized into some discrete action values. Therefore, in this study, the actor-critic method is applied to treat continuous values of agents´ actions. Each robot agent has a set of normal distribution, that determines a distance of the robot movement for a corresponding state of the seesaw system. Based on a result of movement in this system, the normal distribution is modified by actor-critic learning method. The simulation result shows the effectiveness of our approaching method.
Keywords
learning (artificial intelligence); mobile robots; multi-agent systems; normal distribution; Q-learning; actor-critic approach; autonomous multiple agent; cooperative behavior learning; multiagent seesaw balancing problem; multiagent systems; multiple autonomous mobile robot; multiple mobile robot; reinforcement learning; robot agent; robot movement; Autonomous agents; Educational institutions; Environmental management; Gaussian distribution; Human robot interaction; Learning systems; Mobile robots; Multiagent systems; Positron emission tomography; Service robots; Actor-critic; cooperative behaviors; multiagent systems; reinforcement learning; seesaw balancing problems;
fLanguage
English
Publisher
ieee
Conference_Titel
Systems, Man and Cybernetics, 2005 IEEE International Conference on
Print_ISBN
0-7803-9298-1
Type
conf
DOI
10.1109/ICSMC.2005.1571130
Filename
1571130
Link To Document