A New Learning Algorithm for Cooperative Agents in General-Sum Games

Author

Song, Mei-Ping ; An, Ju-Bai ; Chen, Rong

Author_Institution

Dalian Maritime Univ., Dalian

Volume

1

fYear

2007

fDate

19-22 Aug. 2007

Firstpage

50

Lastpage

54

Abstract

The development of multi-agent reinforcement learning in stochastic game has been slowed down in recent years. The main problem is that it is difficult to make the learning satisfy rationality and convergence at the same time. Here, the typical learning algorithms are analyzed firstly, and then a new method called Pareto-Q is prompted with the concept of Pareto optimum, which is rational. At the same time, social conventions are also introduced to promise the convergence of learning. At the last, experiments are presented to prove the good learning result of this algorithm.

Keywords

Pareto optimisation; convergence; cooperative systems; learning (artificial intelligence); stochastic games; Pareto optimum; Pareto-Q method; cooperative agents; general-sum games; learning algorithm; learning convergence; multiagent reinforcement learning; stochastic game; Algorithm design and analysis; Computer science; Convergence; Cybernetics; Educational institutions; Machine learning; Machine learning algorithms; Multiagent systems; Pareto analysis; Stochastic processes; MAS; Pareto optimum; Reinforcement learning; Social conventions;

fLanguage

English

Publisher

ieee

Conference_Titel

Machine Learning and Cybernetics, 2007 International Conference on

Conference_Location

Hong Kong

Print_ISBN

978-1-4244-0973-0

Electronic_ISBN

978-1-4244-0973-0

Type

conf

DOI

10.1109/ICMLC.2007.4370114

Filename

4370114