• DocumentCode
    467633
  • Title

    A New Learning Algorithm for Cooperative Agents in General-Sum Games

  • Author

    Song, Mei-Ping ; An, Ju-Bai ; Chen, Rong

  • Author_Institution
    Dalian Maritime Univ., Dalian
  • Volume
    1
  • fYear
    2007
  • fDate
    19-22 Aug. 2007
  • Firstpage
    50
  • Lastpage
    54
  • Abstract
    The development of multi-agent reinforcement learning in stochastic game has been slowed down in recent years. The main problem is that it is difficult to make the learning satisfy rationality and convergence at the same time. Here, the typical learning algorithms are analyzed firstly, and then a new method called Pareto-Q is prompted with the concept of Pareto optimum, which is rational. At the same time, social conventions are also introduced to promise the convergence of learning. At the last, experiments are presented to prove the good learning result of this algorithm.
  • Keywords
    Pareto optimisation; convergence; cooperative systems; learning (artificial intelligence); stochastic games; Pareto optimum; Pareto-Q method; cooperative agents; general-sum games; learning algorithm; learning convergence; multiagent reinforcement learning; stochastic game; Algorithm design and analysis; Computer science; Convergence; Cybernetics; Educational institutions; Machine learning; Machine learning algorithms; Multiagent systems; Pareto analysis; Stochastic processes; MAS; Pareto optimum; Reinforcement learning; Social conventions;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Machine Learning and Cybernetics, 2007 International Conference on
  • Conference_Location
    Hong Kong
  • Print_ISBN
    978-1-4244-0973-0
  • Electronic_ISBN
    978-1-4244-0973-0
  • Type

    conf

  • DOI
    10.1109/ICMLC.2007.4370114
  • Filename
    4370114