Title :
Real-Time Adaptive Control of a Flexible Manipulator Using Reinforcement Learning
Author :
Pradhan, Santanu Kumar ; Subudhi, Bidyadhar
Author_Institution :
Dept. of Electr. Eng., Nat. Inst. of Technol., Rourkela, India
fDate :
4/1/2012 12:00:00 AM
Abstract :
This paper exploits reinforcement learning (RL) for developing real-time adaptive control of tip trajectory and deflection of a two-link flexible manipulator handling variable payloads. This proposed adaptive controller consists of a proportional derivative (PD) tracking loop and an actor-critic-based RL loop that adapts the actor and critic weights in response to payload variations while suppressing the tip deflection and tracking the desired trajectory. The actor-critic-based RL loop uses a recursive least square (RLS)-based temporal difference (TD) learning with eligibility trace and an adaptive memory to estimate the critic weights and a gradient-based estimator for estimating actor weights. Tip trajectory tracking and suppression of tip deflection performances of the proposed RL-based adaptive controller (RLAC) are compared with that of a nonlinear regression-based direct adaptive controller (DAC) and a fuzzy learning-based adaptive controller (FLAC). Simulation and experimental results envisage that the RLAC outperforms both the DAC and FLAC.
Keywords :
PD control; adaptive control; flexible manipulators; fuzzy control; gradient methods; learning systems; least squares approximations; nonlinear control systems; position control; recursive estimation; regression analysis; PD tracking loop; actor weights; actor-critic-based reinforcement learning loop; adaptive memory; complex flexible space shuttle system; critic weights; distributed flexibility; eligibility trace; flexible manipulator; fuzzy learning-based adaptive controller; gradient-based estimator; nonlinear regression-based direct adaptive controller; payload variations; proportional derivative tracking loop; real-time adaptive control; recursive least square-based temporal difference learning; tip deflection suppression; tip trajectory tracking; Adaptive control; Dynamics; Manipulators; Payloads; Real-time systems; Trajectory; Vectors; Adaptive control; flexible-link manipulator; reinforcement learning; tip trajectory tracking;
Journal_Title :
Automation Science and Engineering, IEEE Transactions on
DOI :
10.1109/TASE.2012.2189004