DocumentCode
2570263
Title
A model-free robust policy iteration algorithm for optimal control of nonlinear systems
Author
Bhasin, S. ; Johnson, M. ; Dixon, W.E.
Author_Institution
Dept. of Mech. & Aerosp. Eng., Univ. of Florida, Gainesville, FL, USA
fYear
2010
fDate
15-17 Dec. 2010
Firstpage
3060
Lastpage
3065
Abstract
An online model-free solution is developed for the infinite-horizon optimal control problem for continuous-time nonlinear systems. A novel actor-critic-identifier (ACI) structure is used to implement the Policy Iteration algorithm, wherein two neural network structures are used - a robust dynamic neural network (DNN) to asymptotically identify the uncertain system with additive disturbances, and a critic NN to approximate the value function. The weight update laws for the critic NN are generated using a gradient-descent method based on a modified temporal difference error, which is independent of the system dynamics. The optimal control law (or the actor) is computed using the critic NN and the identifier DNN. Uniformly ultimately bounded (UUB) stability of the closed-loop system is guaranteed. The actor, critic and identifier structures are implemented in real-time, continuously and simultaneously.
Keywords
closed loop systems; continuous time systems; gradient methods; neurocontrollers; nonlinear systems; optimal control; stability; uncertain systems; actor critic identifier structure; closed loop system; continuous time nonlinear systems; dynamic neural network; gradient descent method; infinite horizon optimal control problem; model free robust policy iteration algorithm; uncertain system; uniformly ultimately bounded stability; Approximation algorithms; Approximation methods; Artificial neural networks; Heuristic algorithms; Nonlinear systems; Optimal control; Robustness;
fLanguage
English
Publisher
ieee
Conference_Titel
Decision and Control (CDC), 2010 49th IEEE Conference on
Conference_Location
Atlanta, GA
ISSN
0743-1546
Print_ISBN
978-1-4244-7745-6
Type
conf
DOI
10.1109/CDC.2010.5717295
Filename
5717295
Link To Document