Continuous time Option algorithm of multi-agent systems

Author

Zhang Xiaoyan ; Tang Hao ; Han Jianghong ; Zhou Lei

Author_Institution

Sch. of Comput. & Inf., Hefei Univ. of Technol., Hefei, China

fYear

2010

fDate

29-31 July 2010

Firstpage

1517

Lastpage

1522

Abstract

The traditional hierachical reinforcement learning methods is used to solve multi-Agent system, which based on discrete time multi-Agent semi-Markov decision process with discount criteria, which cannot apply to continuous time multi-Agent infinite tasks. Therefore, in this paper, we introduce a kind of continuous time multi-Agent hierarchical reinforcement learning model, and propose an Option algorithm that applies to average or discounts criteria. The algorithm is under the framework of continuous time multi-Agent semi-Markov decision process, and it introduces a method of macro action communication that between agents on the top, which can solve a wide class of continuing tasks of continuous time multi-Agent. Finally, this proposed hierarchical reinforcement learning optimization algorithm is tested in a multi-Agent robotic garbage collection system, and the experimental results show that it needs less memory, and has a better optimization performance and faster learning speed than a multi-Agent continuous time Option algorithm, which use joint stat and joint macro action on the top.

Keywords

learning (artificial intelligence); multi-agent systems; continuous time multiagent infinite task; continuous time multiagent semi-Markov decision process; continuous time option algorithm; discount criteria; discrete time multiagent semi-Markov decision process; hierachical reinforcement learning; macro action communication; multiagent robotic garbage collection system; multiagent system; optimization algorithm; Electronic mail; Joints; Learning; Markov processes; Multiagent systems; Simulated annealing; Continuous Time Multi-Agent Semi-Markov Decision Process; Hierarchical Reinforcement Learning; Option;

fLanguage

English

Publisher

ieee

Conference_Titel

Control Conference (CCC), 2010 29th Chinese

Conference_Location

Beijing

Print_ISBN

978-1-4244-6263-6

Type

conf

Filename

5573967