DocumentCode :
2717241
Title :
A Theoretical Analysis of Cooperative Behavior in Multi-agent Q-learning
Author :
Waltman, Ludo ; Kaymak, Uzay
Author_Institution :
Erasmus Sch. of Econ., Erasmus Univ. Rotterdam
fYear :
2007
fDate :
1-5 April 2007
Firstpage :
84
Lastpage :
91
Abstract :
A number of experimental studies have investigated whether cooperative behavior may emerge in multi-agent Q-learning. In some studies cooperative behavior did emerge, in others it did not. This paper provides a theoretical analysis of this issue. The analysis focuses on multi-agent Q-learning in iterated prisoner´s dilemmas. It is shown that under certain assumptions cooperative behavior may emerge when multi-agent Q-learning is applied in an iterated prisoner´s dilemma. An important consequence of the analysis is that multi-agent Q-learning may result in non-Nash behavior. It is found experimentally that the theoretical results presented in this paper are quite robust to violations of the underlying assumptions
Keywords :
learning (artificial intelligence); multi-agent systems; cooperative behavior; multiagent Q-learning; theoretical analysis; Algorithm design and analysis; Dynamic programming; Environmental economics; Helium; Learning; Microeconomics; Nash equilibrium; Oligopoly; Performance analysis; Robustness;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Approximate Dynamic Programming and Reinforcement Learning, 2007. ADPRL 2007. IEEE International Symposium on
Conference_Location :
Honolulu, HI
Print_ISBN :
1-4244-0706-0
Type :
conf
DOI :
10.1109/ADPRL.2007.368173
Filename :
4220818
Link To Document :
بازگشت