Title :
Cooperative multi-agent reinforcement learning based on online heuristic extraction
Author :
Jun Wu ; Xin Xu ; Zhen-ping Sun ; Yan Huang
Author_Institution :
Coll. of Mechatron. & Autom., Nat. Univ. of Defense Technol., Changsha, China
Abstract :
Reinforcement learning has been an important technique for adaptive decision-making of multi-agent systems in uncertain environments. However, the curse of dimensionality in multi-agent reinforcement learning usually causes the slow learning convergence or even failure. In this paper, a novel Online Heuristics Extraction method, which can integrate the prior heuristic policy with a learned heuristic policy, is presented. The new method can be incorporated into a tabular or approximate cooperative multi-agent reinforcement learning algorithm so as to speed up the learning process. Simulation results on a cooperative learning task show that, with the new method, a much better learning convergence can be achieved.
Keywords :
iterative methods; learning (artificial intelligence); multi-agent systems; adaptive decision making; cooperative multiagent reinforcement learning algorithm; heuristic policy; online heuristic extraction method; Approximation algorithms; Convergence; Heuristic algorithms; Joints; Learning; Learning systems; Robots; cooperative; heuristic policy; multi-agent; policy iteration; reinforcement learning;
Conference_Titel :
Natural Computation (ICNC), 2011 Seventh International Conference on
Conference_Location :
Shanghai
Print_ISBN :
978-1-4244-9950-2
DOI :
10.1109/ICNC.2011.6022301