• DocumentCode
    2717241
  • Title

    A Theoretical Analysis of Cooperative Behavior in Multi-agent Q-learning

  • Author

    Waltman, Ludo ; Kaymak, Uzay

  • Author_Institution
    Erasmus Sch. of Econ., Erasmus Univ. Rotterdam
  • fYear
    2007
  • fDate
    1-5 April 2007
  • Firstpage
    84
  • Lastpage
    91
  • Abstract
    A number of experimental studies have investigated whether cooperative behavior may emerge in multi-agent Q-learning. In some studies cooperative behavior did emerge, in others it did not. This paper provides a theoretical analysis of this issue. The analysis focuses on multi-agent Q-learning in iterated prisoner´s dilemmas. It is shown that under certain assumptions cooperative behavior may emerge when multi-agent Q-learning is applied in an iterated prisoner´s dilemma. An important consequence of the analysis is that multi-agent Q-learning may result in non-Nash behavior. It is found experimentally that the theoretical results presented in this paper are quite robust to violations of the underlying assumptions
  • Keywords
    learning (artificial intelligence); multi-agent systems; cooperative behavior; multiagent Q-learning; theoretical analysis; Algorithm design and analysis; Dynamic programming; Environmental economics; Helium; Learning; Microeconomics; Nash equilibrium; Oligopoly; Performance analysis; Robustness;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Approximate Dynamic Programming and Reinforcement Learning, 2007. ADPRL 2007. IEEE International Symposium on
  • Conference_Location
    Honolulu, HI
  • Print_ISBN
    1-4244-0706-0
  • Type

    conf

  • DOI
    10.1109/ADPRL.2007.368173
  • Filename
    4220818