• DocumentCode
    1998070
  • Title

    Reinforcement learning methods for multi-linked manipulator obstacle avoidance and control

  • Author

    Tham, Chen K. ; Prager, Richard W.

  • Author_Institution
    Dept. of Eng., Cambridge Univ., UK
  • fYear
    1993
  • fDate
    15-16 Jul 1993
  • Firstpage
    140
  • Lastpage
    145
  • Abstract
    This paper treats the multi-linked manipulator obstacle avoidance and control task as the interaction between a learning agent and an unknown environment. The role of the agent is to generate actions that maximises the reward that it receives from the environment. We demonstrate how two learning algorithms common in reinforcement learning literature- adaptive heuristic critic (AHC) (Barto et al., 1983), and Q-learning (Watkins, 1989)-can be used to solve the task successfully in two different ways: 1) through the generation of position commands to a PD controller which produces torque commands to drive the manipulator, and 2) through the direct generation of torque commands, removing the need for a PD controller. During the process, the inverse kinematics problem for multi-linked manipulators is automatically solved. Fast function approximation is achieved through the use of an array of cerebellar model arithmetic computers (CMAC). The generation of both discrete and continuous actions are investigated and the performance of the algorithms in terms of learning rates, efficiency of solutions, and memory requirements are evaluated
  • Keywords
    adaptive systems; function approximation; kinematics; learning systems; neural nets; nonlinear control systems; path planning; CMAC; PD controller; Q-learning; adaptive heuristic critic; cerebellar model arithmetic computers; function approximation; inverse kinematics; multilinked manipulator; obstacle avoidance; position commands; reinforcement learning; torque commands; Adaptive control; Function approximation; Kinematics; Machine learning; Manipulator dynamics; PD control; Programmable control; Robots; Stochastic processes; Torque control;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Motion Control Proceedings, 1993., Asia-Pacific Workshop on Advances in
  • Print_ISBN
    0-7803-1223-6
  • Type

    conf

  • DOI
    10.1109/APWAM.1993.316204
  • Filename
    316204