Title :
Robust asymptotic tracking of a class of nonlinear systems using an adaptive critic based controller
Author :
Bhasin, S. ; Sharma, Neelam ; Patre, P.M. ; Dixon, W.E.
Author_Institution :
Dept. of Mech. & Aerosp. Eng., Univ. of Florida, Gainesville, FL, USA
fDate :
June 30 2010-July 2 2010
Abstract :
Traditional Reinforcement Learning (RL) controllers are based on a discrete formulation of the Dynamic Programming (DP) problem, which impedes the development of rigorous stability analysis of continuous-time closed loop controllers for uncertain nonlinear systems. Non-DP based RL controllers typically yield a uniformly ultimately bounded (UUB) stability result due to the presence of disturbances and unknown approximation errors. In this paper a non-DP based reinforcement learning scheme is developed for asymptotic tracking of a class of uncertain nonlinear systems with bounded disturbances. A recently developed RISE (Robust Integral of the Sign of the Error) feedback technique is used in conjunction with a feedforward neural network (NN) based Actor-Critic architecture to yield a semi-global asymptotic result. A composite weight tuning law for the Action NN, consisting of both unsupervised and reinforcement learning terms, is developed based on Lyapunov stability analysis.
Keywords :
Lyapunov methods; adaptive control; approximation theory; closed loop systems; continuous time systems; dynamic programming; feedforward neural nets; learning (artificial intelligence); learning systems; neurocontrollers; nonlinear control systems; robust control; tracking; uncertain systems; unsupervised learning; Lyapunov stability analysis; RISE feedback technique; adaptive critic based controller; asymptotic tracking; bounded disturbances; continuous-time closed loop controllers; discrete formulation; dynamic programming problem; feedforward neural network based actor-critic architecture; nonDP based RL controllers; reinforcement learning controllers; robust asymptotic tracking; robust integral of the sign of the error; stability analysis; uncertain nonlinear systems; uniformly ultimately bounded stability; unknown approximation errors; unsupervised learning terms; Adaptive control; Adaptive systems; Control systems; Dynamic programming; Learning; Neural networks; Nonlinear control systems; Nonlinear systems; Programmable control; Robust control;
Conference_Titel :
American Control Conference (ACC), 2010
Conference_Location :
Baltimore, MD
Print_ISBN :
978-1-4244-7426-4
DOI :
10.1109/ACC.2010.5530744