• DocumentCode
    2121241
  • Title

    Continuous time Option algorithm of multi-agent systems

  • Author

    Zhang Xiaoyan ; Tang Hao ; Han Jianghong ; Zhou Lei

  • Author_Institution
    Sch. of Comput. & Inf., Hefei Univ. of Technol., Hefei, China
  • fYear
    2010
  • fDate
    29-31 July 2010
  • Firstpage
    1517
  • Lastpage
    1522
  • Abstract
    The traditional hierachical reinforcement learning methods is used to solve multi-Agent system, which based on discrete time multi-Agent semi-Markov decision process with discount criteria, which cannot apply to continuous time multi-Agent infinite tasks. Therefore, in this paper, we introduce a kind of continuous time multi-Agent hierarchical reinforcement learning model, and propose an Option algorithm that applies to average or discounts criteria. The algorithm is under the framework of continuous time multi-Agent semi-Markov decision process, and it introduces a method of macro action communication that between agents on the top, which can solve a wide class of continuing tasks of continuous time multi-Agent. Finally, this proposed hierarchical reinforcement learning optimization algorithm is tested in a multi-Agent robotic garbage collection system, and the experimental results show that it needs less memory, and has a better optimization performance and faster learning speed than a multi-Agent continuous time Option algorithm, which use joint stat and joint macro action on the top.
  • Keywords
    learning (artificial intelligence); multi-agent systems; continuous time multiagent infinite task; continuous time multiagent semi-Markov decision process; continuous time option algorithm; discount criteria; discrete time multiagent semi-Markov decision process; hierachical reinforcement learning; macro action communication; multiagent robotic garbage collection system; multiagent system; optimization algorithm; Electronic mail; Joints; Learning; Markov processes; Multiagent systems; Simulated annealing; Continuous Time Multi-Agent Semi-Markov Decision Process; Hierarchical Reinforcement Learning; Option;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Control Conference (CCC), 2010 29th Chinese
  • Conference_Location
    Beijing
  • Print_ISBN
    978-1-4244-6263-6
  • Type

    conf

  • Filename
    5573967