DocumentCode
2717241
Title
A Theoretical Analysis of Cooperative Behavior in Multi-agent Q-learning
Author
Waltman, Ludo ; Kaymak, Uzay
Author_Institution
Erasmus Sch. of Econ., Erasmus Univ. Rotterdam
fYear
2007
fDate
1-5 April 2007
Firstpage
84
Lastpage
91
Abstract
A number of experimental studies have investigated whether cooperative behavior may emerge in multi-agent Q-learning. In some studies cooperative behavior did emerge, in others it did not. This paper provides a theoretical analysis of this issue. The analysis focuses on multi-agent Q-learning in iterated prisoner´s dilemmas. It is shown that under certain assumptions cooperative behavior may emerge when multi-agent Q-learning is applied in an iterated prisoner´s dilemma. An important consequence of the analysis is that multi-agent Q-learning may result in non-Nash behavior. It is found experimentally that the theoretical results presented in this paper are quite robust to violations of the underlying assumptions
Keywords
learning (artificial intelligence); multi-agent systems; cooperative behavior; multiagent Q-learning; theoretical analysis; Algorithm design and analysis; Dynamic programming; Environmental economics; Helium; Learning; Microeconomics; Nash equilibrium; Oligopoly; Performance analysis; Robustness;
fLanguage
English
Publisher
ieee
Conference_Titel
Approximate Dynamic Programming and Reinforcement Learning, 2007. ADPRL 2007. IEEE International Symposium on
Conference_Location
Honolulu, HI
Print_ISBN
1-4244-0706-0
Type
conf
DOI
10.1109/ADPRL.2007.368173
Filename
4220818
Link To Document