DocumentCode :
467633
Title :
A New Learning Algorithm for Cooperative Agents in General-Sum Games
Author :
Song, Mei-Ping ; An, Ju-Bai ; Chen, Rong
Author_Institution :
Dalian Maritime Univ., Dalian
Volume :
1
fYear :
2007
fDate :
19-22 Aug. 2007
Firstpage :
50
Lastpage :
54
Abstract :
The development of multi-agent reinforcement learning in stochastic game has been slowed down in recent years. The main problem is that it is difficult to make the learning satisfy rationality and convergence at the same time. Here, the typical learning algorithms are analyzed firstly, and then a new method called Pareto-Q is prompted with the concept of Pareto optimum, which is rational. At the same time, social conventions are also introduced to promise the convergence of learning. At the last, experiments are presented to prove the good learning result of this algorithm.
Keywords :
Pareto optimisation; convergence; cooperative systems; learning (artificial intelligence); stochastic games; Pareto optimum; Pareto-Q method; cooperative agents; general-sum games; learning algorithm; learning convergence; multiagent reinforcement learning; stochastic game; Algorithm design and analysis; Computer science; Convergence; Cybernetics; Educational institutions; Machine learning; Machine learning algorithms; Multiagent systems; Pareto analysis; Stochastic processes; MAS; Pareto optimum; Reinforcement learning; Social conventions;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Machine Learning and Cybernetics, 2007 International Conference on
Conference_Location :
Hong Kong
Print_ISBN :
978-1-4244-0973-0
Electronic_ISBN :
978-1-4244-0973-0
Type :
conf
DOI :
10.1109/ICMLC.2007.4370114
Filename :
4370114
Link To Document :
بازگشت