• DocumentCode
    2543723
  • Title

    Adaptive optimal controllers based on Generalized Policy Iteration in a continuous-time framework

  • Author

    Vrabie, Draguna ; Vamvoudakis, Kyriakos ; Lewis, Frank

  • Author_Institution
    Autom. & Robot. Res. Inst., Univ. of Texas at Arlington, Arlington, TX, USA
  • fYear
    2009
  • fDate
    24-26 June 2009
  • Firstpage
    1402
  • Lastpage
    1409
  • Abstract
    In this paper we present two adaptive algorithms which offer solution to the continuous-time optimal control problem for nonlinear, affine in the inputs, time-invariant systems. Both algorithms were developed based on the generalized policy iteration technique and involve adaptation of two neural network structures namely actor, providing the control signal, and critic, performing evaluation of the control performance. Despite the similarities, the two adaptive algorithms differ in the manner in which the adaptation takes place, required knowledge on the system dynamics, and formulation of the persistence of excitation requirement. The main difference is that one algorithm uses sequential adaptation of the actor and critic structures, i.e. while one is trained the other one is kept constant, while for the second algorithm the two neural networks are trained synchronously in a continuous-time fashion. The two algorithms are described in detail and proof of convergence is provided. Simulation results of applying the two algorithms for finding the optimal state feedback controller of a nonlinear system are also presented.
  • Keywords
    adaptive control; continuous time systems; neurocontrollers; nonlinear control systems; optimal control; adaptive actor algorithm; adaptive critic algorithm; adaptive optimal controller; continuous-time framework; generalized policy iteration; neural network; nonlinear system; time-invariant system; Adaptive algorithm; Adaptive control; Convergence; Neural networks; Nonlinear control systems; Nonlinear systems; Optimal control; Performance evaluation; Programmable control; State feedback; Generalized Policy Iteration; adaptive critics; adaptive optimal control; continuous-time; neural networks;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Control and Automation, 2009. MED '09. 17th Mediterranean Conference on
  • Conference_Location
    Thessaloniki
  • Print_ISBN
    978-1-4244-4684-1
  • Electronic_ISBN
    978-1-4244-4685-8
  • Type

    conf

  • DOI
    10.1109/MED.2009.5164743
  • Filename
    5164743