• DocumentCode
    2294549
  • Title

    A reinforcement learning scheme for a multi-agent card game

  • Author

    Fujita, Hajime ; Matsuno, Yoichiro ; Ishii, Shin

  • Author_Institution
    Nara Inst. of Sci. & Technol., Japan
  • Volume
    5
  • fYear
    2003
  • fDate
    5-8 Oct. 2003
  • Firstpage
    4071
  • Abstract
    We formulate an automatic strategy acquisition problem for the multi-agent card game "hearts" as a reinforcement learning (RL) problem. Since there are often a lot of unobservable cards in this game, RL is approximately dealt with in the framework of a partially observable Markov decision process (POMDP). This article presents a POMDP-RL method based on estimation of unobservable state variables and prediction of actions of the opponent agents. Simulation results show our model-based POMDP-RL method is applicable to a realistic multi-agent problem.
  • Keywords
    Markov processes; computer games; decision making; learning (artificial intelligence); multi-agent systems; action control; actor-critic algorithm; automatic strategy acquisition problem; hearts card game; multiagent card game; partially observable Markov decision process; reinforcement learning scheme; state variables; Accelerated aging; Clocks; Heart; Learning; State estimation; State-space methods; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Systems, Man and Cybernetics, 2003. IEEE International Conference on
  • ISSN
    1062-922X
  • Print_ISBN
    0-7803-7952-7
  • Type

    conf

  • DOI
    10.1109/ICSMC.2003.1245625
  • Filename
    1245625