DocumentCode
3181946
Title
A Multiagent Fuzzy Policy Reinforcement Learning Algorithm with Application to Leader-Follower Robotic Systems
Author
Yang, Erfu ; Gu, Dongbing
Author_Institution
Dept. of Comput. Sci., Essex Univ., Colchester
fYear
2006
fDate
9-15 Oct. 2006
Firstpage
3197
Lastpage
3202
Abstract
A multiagent reinforcement learning algorithm with fuzzy policy is addressed in this paper for dealing with the learning and control issues in cooperative multiagent systems with continuous states and actions, particularly for autonomous robotic formation systems. The parameters of fuzzy policy are finely tuned by the gradient multiagent reinforcement learning algorithm to improve the overall performance of an initial controller (policy). A leader-follower robotic system is chosen as a platform to benchmark the performance of the multiagent fuzzy policy reinforcement learning algorithm. Our simulation results demonstrate that the control performance can be improved in many aspects. This work also can be seen as a scaling up of currently popular multiagent reinforcement learning to the robotic domain with continuous state and action space as well as high dimensionality
Keywords
fuzzy control; gradient methods; learning (artificial intelligence); multi-agent systems; multi-robot systems; position control; autonomous robotic formation systems; cooperative multiagent systems; gradient multiagent reinforcement learning algorithm; leader-follower robotic systems; multiagent fuzzy policy reinforcement learning algorithm; Control systems; Convergence; Function approximation; Fuzzy control; Fuzzy systems; Intelligent robots; Learning; Nash equilibrium; Orbital robotics; Stochastic processes; Leader-Follower robotic systems; Policy gradient reinforcement learning; cooperative control; fuzzy reinforcement learning;
fLanguage
English
Publisher
ieee
Conference_Titel
Intelligent Robots and Systems, 2006 IEEE/RSJ International Conference on
Conference_Location
Beijing
Print_ISBN
1-4244-0258-1
Electronic_ISBN
1-4244-0259-X
Type
conf
DOI
10.1109/IROS.2006.282421
Filename
4058888
Link To Document