• DocumentCode
    3181946
  • Title

    A Multiagent Fuzzy Policy Reinforcement Learning Algorithm with Application to Leader-Follower Robotic Systems

  • Author

    Yang, Erfu ; Gu, Dongbing

  • Author_Institution
    Dept. of Comput. Sci., Essex Univ., Colchester
  • fYear
    2006
  • fDate
    9-15 Oct. 2006
  • Firstpage
    3197
  • Lastpage
    3202
  • Abstract
    A multiagent reinforcement learning algorithm with fuzzy policy is addressed in this paper for dealing with the learning and control issues in cooperative multiagent systems with continuous states and actions, particularly for autonomous robotic formation systems. The parameters of fuzzy policy are finely tuned by the gradient multiagent reinforcement learning algorithm to improve the overall performance of an initial controller (policy). A leader-follower robotic system is chosen as a platform to benchmark the performance of the multiagent fuzzy policy reinforcement learning algorithm. Our simulation results demonstrate that the control performance can be improved in many aspects. This work also can be seen as a scaling up of currently popular multiagent reinforcement learning to the robotic domain with continuous state and action space as well as high dimensionality
  • Keywords
    fuzzy control; gradient methods; learning (artificial intelligence); multi-agent systems; multi-robot systems; position control; autonomous robotic formation systems; cooperative multiagent systems; gradient multiagent reinforcement learning algorithm; leader-follower robotic systems; multiagent fuzzy policy reinforcement learning algorithm; Control systems; Convergence; Function approximation; Fuzzy control; Fuzzy systems; Intelligent robots; Learning; Nash equilibrium; Orbital robotics; Stochastic processes; Leader-Follower robotic systems; Policy gradient reinforcement learning; cooperative control; fuzzy reinforcement learning;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Robots and Systems, 2006 IEEE/RSJ International Conference on
  • Conference_Location
    Beijing
  • Print_ISBN
    1-4244-0258-1
  • Electronic_ISBN
    1-4244-0259-X
  • Type

    conf

  • DOI
    10.1109/IROS.2006.282421
  • Filename
    4058888