Title :
A model-free robust policy iteration algorithm for optimal control of nonlinear systems
Author :
Bhasin, S. ; Johnson, M. ; Dixon, W.E.
Author_Institution :
Dept. of Mech. & Aerosp. Eng., Univ. of Florida, Gainesville, FL, USA
Abstract :
An online model-free solution is developed for the infinite-horizon optimal control problem for continuous-time nonlinear systems. A novel actor-critic-identifier (ACI) structure is used to implement the Policy Iteration algorithm, wherein two neural network structures are used - a robust dynamic neural network (DNN) to asymptotically identify the uncertain system with additive disturbances, and a critic NN to approximate the value function. The weight update laws for the critic NN are generated using a gradient-descent method based on a modified temporal difference error, which is independent of the system dynamics. The optimal control law (or the actor) is computed using the critic NN and the identifier DNN. Uniformly ultimately bounded (UUB) stability of the closed-loop system is guaranteed. The actor, critic and identifier structures are implemented in real-time, continuously and simultaneously.
Keywords :
closed loop systems; continuous time systems; gradient methods; neurocontrollers; nonlinear systems; optimal control; stability; uncertain systems; actor critic identifier structure; closed loop system; continuous time nonlinear systems; dynamic neural network; gradient descent method; infinite horizon optimal control problem; model free robust policy iteration algorithm; uncertain system; uniformly ultimately bounded stability; Approximation algorithms; Approximation methods; Artificial neural networks; Heuristic algorithms; Nonlinear systems; Optimal control; Robustness;
Conference_Titel :
Decision and Control (CDC), 2010 49th IEEE Conference on
Conference_Location :
Atlanta, GA
Print_ISBN :
978-1-4244-7745-6
DOI :
10.1109/CDC.2010.5717295