DocumentCode
467633
Title
A New Learning Algorithm for Cooperative Agents in General-Sum Games
Author
Song, Mei-Ping ; An, Ju-Bai ; Chen, Rong
Author_Institution
Dalian Maritime Univ., Dalian
Volume
1
fYear
2007
fDate
19-22 Aug. 2007
Firstpage
50
Lastpage
54
Abstract
The development of multi-agent reinforcement learning in stochastic game has been slowed down in recent years. The main problem is that it is difficult to make the learning satisfy rationality and convergence at the same time. Here, the typical learning algorithms are analyzed firstly, and then a new method called Pareto-Q is prompted with the concept of Pareto optimum, which is rational. At the same time, social conventions are also introduced to promise the convergence of learning. At the last, experiments are presented to prove the good learning result of this algorithm.
Keywords
Pareto optimisation; convergence; cooperative systems; learning (artificial intelligence); stochastic games; Pareto optimum; Pareto-Q method; cooperative agents; general-sum games; learning algorithm; learning convergence; multiagent reinforcement learning; stochastic game; Algorithm design and analysis; Computer science; Convergence; Cybernetics; Educational institutions; Machine learning; Machine learning algorithms; Multiagent systems; Pareto analysis; Stochastic processes; MAS; Pareto optimum; Reinforcement learning; Social conventions;
fLanguage
English
Publisher
ieee
Conference_Titel
Machine Learning and Cybernetics, 2007 International Conference on
Conference_Location
Hong Kong
Print_ISBN
978-1-4244-0973-0
Electronic_ISBN
978-1-4244-0973-0
Type
conf
DOI
10.1109/ICMLC.2007.4370114
Filename
4370114
Link To Document