• DocumentCode
    2863032
  • Title

    Self-organizing cognitive agents and reinforcement learning in multi-agent environment

  • Author

    Tan, Ah-Hwee ; Xiao, Dan

  • Author_Institution
    Sch. of Comput. Eng., Nanyang Technol. Univ., Singapore
  • fYear
    2005
  • fDate
    19-22 Sept. 2005
  • Firstpage
    351
  • Lastpage
    357
  • Abstract
    This paper presents a self-organizing cognitive architecture, known as TD-FALCON, that learns to function through its interaction with the environment. TD-FALCON learns the value functions of the state-action space estimated through a temporal difference (TD) method. The learned value functions are then used to determine the optimal actions based on an action selection policy. We present a specific instance of TD-FALCON based on an e-greedy action policy and a Q-learning value estimation formula. Experiments based on a minefield navigation task and a minefield pursuit task show that TD-FALCON systems are able to adapt and function well in a multi-agent environment without an explicit mechanism for collaboration.
  • Keywords
    cognition; learning (artificial intelligence); multi-agent systems; self-organising feature maps; Q-learning value estimation formula; TD-FALCON; e-greedy action policy; minefield navigation task; minefield pursuit task; multiagent environment; reinforcement learning; self-organizing cognitive agent; temporal difference method; Autonomous agents; Collaboration; Computer architecture; Navigation; Neurofeedback; Predictive models; Space technology; State estimation; Subspace constraints; Supervised learning;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Agent Technology, IEEE/WIC/ACM International Conference on
  • Print_ISBN
    0-7695-2416-8
  • Type

    conf

  • DOI
    10.1109/IAT.2005.125
  • Filename
    1565565