• DocumentCode
    2570263
  • Title

    A model-free robust policy iteration algorithm for optimal control of nonlinear systems

  • Author

    Bhasin, S. ; Johnson, M. ; Dixon, W.E.

  • Author_Institution
    Dept. of Mech. & Aerosp. Eng., Univ. of Florida, Gainesville, FL, USA
  • fYear
    2010
  • fDate
    15-17 Dec. 2010
  • Firstpage
    3060
  • Lastpage
    3065
  • Abstract
    An online model-free solution is developed for the infinite-horizon optimal control problem for continuous-time nonlinear systems. A novel actor-critic-identifier (ACI) structure is used to implement the Policy Iteration algorithm, wherein two neural network structures are used - a robust dynamic neural network (DNN) to asymptotically identify the uncertain system with additive disturbances, and a critic NN to approximate the value function. The weight update laws for the critic NN are generated using a gradient-descent method based on a modified temporal difference error, which is independent of the system dynamics. The optimal control law (or the actor) is computed using the critic NN and the identifier DNN. Uniformly ultimately bounded (UUB) stability of the closed-loop system is guaranteed. The actor, critic and identifier structures are implemented in real-time, continuously and simultaneously.
  • Keywords
    closed loop systems; continuous time systems; gradient methods; neurocontrollers; nonlinear systems; optimal control; stability; uncertain systems; actor critic identifier structure; closed loop system; continuous time nonlinear systems; dynamic neural network; gradient descent method; infinite horizon optimal control problem; model free robust policy iteration algorithm; uncertain system; uniformly ultimately bounded stability; Approximation algorithms; Approximation methods; Artificial neural networks; Heuristic algorithms; Nonlinear systems; Optimal control; Robustness;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Decision and Control (CDC), 2010 49th IEEE Conference on
  • Conference_Location
    Atlanta, GA
  • ISSN
    0743-1546
  • Print_ISBN
    978-1-4244-7745-6
  • Type

    conf

  • DOI
    10.1109/CDC.2010.5717295
  • Filename
    5717295