• DocumentCode
    2592323
  • Title

    Simultaneous learning to acquire competitive behaviors in multi-agent system based on a modular learning system

  • Author

    Takahashi, Yasutake ; Edazawa, Kazuhiro ; Noma, Kentarou ; Asada, Minoru

  • Author_Institution
    Dept. of Adaptive Machine Syst., Osaka Univ., Japan
  • fYear
    2005
  • fDate
    2-6 Aug. 2005
  • Firstpage
    2016
  • Lastpage
    2022
  • Abstract
    Existing reinforcement learning approaches have been suffering from the policy alternation of others in multiagent dynamic environments. A typical example is the case of RoboCup competitions because other agent behaviors may cause sudden changes in state transition probabilities in which constancy is needed for the learning to converge. The keys for simultaneous learning to acquire competitive behaviors in such an environment are: a modular learning system for adaptation to the policy alternation of others; and an introduction of macro actions for simultaneous learning to reduce the search space. This paper presents a method of modular learning in a multiagent environment in which the learning agents can simultaneously learn their behaviors and adapt themselves to the situations as a consequence of the others´ behaviors.
  • Keywords
    learning (artificial intelligence); learning systems; multi-agent systems; multi-robot systems; RoboCup; competitive behaviors acquisition; learning agent; modular learning system; multiagent system; reinforcement learning; simultaneous learning; state transition probability; Adaptive systems; Electronic design automation and methodology; Jacobian matrices; Learning systems; Multiagent systems; Robots; RoboCup; competitive behaviors acquisition; modular learning system; multi-agent system; reinforcement learning; simultaneous learning;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Robots and Systems, 2005. (IROS 2005). 2005 IEEE/RSJ International Conference on
  • Print_ISBN
    0-7803-8912-3
  • Type

    conf

  • DOI
    10.1109/IROS.2005.1544974
  • Filename
    1544974