A Multiagent Fuzzy Policy Reinforcement Learning Algorithm with Application to Leader-Follower Robotic Systems

Author

Yang, Erfu ; Gu, Dongbing

Author_Institution

Dept. of Comput. Sci., Essex Univ., Colchester

fYear

2006

fDate

9-15 Oct. 2006

Firstpage

3197

Lastpage

3202

Abstract

A multiagent reinforcement learning algorithm with fuzzy policy is addressed in this paper for dealing with the learning and control issues in cooperative multiagent systems with continuous states and actions, particularly for autonomous robotic formation systems. The parameters of fuzzy policy are finely tuned by the gradient multiagent reinforcement learning algorithm to improve the overall performance of an initial controller (policy). A leader-follower robotic system is chosen as a platform to benchmark the performance of the multiagent fuzzy policy reinforcement learning algorithm. Our simulation results demonstrate that the control performance can be improved in many aspects. This work also can be seen as a scaling up of currently popular multiagent reinforcement learning to the robotic domain with continuous state and action space as well as high dimensionality

Keywords

fuzzy control; gradient methods; learning (artificial intelligence); multi-agent systems; multi-robot systems; position control; autonomous robotic formation systems; cooperative multiagent systems; gradient multiagent reinforcement learning algorithm; leader-follower robotic systems; multiagent fuzzy policy reinforcement learning algorithm; Control systems; Convergence; Function approximation; Fuzzy control; Fuzzy systems; Intelligent robots; Learning; Nash equilibrium; Orbital robotics; Stochastic processes; Leader-Follower robotic systems; Policy gradient reinforcement learning; cooperative control; fuzzy reinforcement learning;

fLanguage

English

Publisher

ieee

Conference_Titel

Intelligent Robots and Systems, 2006 IEEE/RSJ International Conference on

Conference_Location

Beijing

Print_ISBN

1-4244-0258-1

Electronic_ISBN

1-4244-0259-X

Type

conf

DOI

10.1109/IROS.2006.282421

Filename

4058888