• DocumentCode
    2856757
  • Title

    Finite-horizon input-constrained nonlinear optimal control using single network adaptive critics

  • Author

    Heydari, A. ; Balakrishnan, S.N.

  • Author_Institution
    Mech. & Aerosp. Eng. Dept., Missouri Univ. of Sci. & Technol., Rolla, MO, USA
  • fYear
    2011
  • fDate
    June 29 2011-July 1 2011
  • Firstpage
    3047
  • Lastpage
    3052
  • Abstract
    A single neural network based controller called the Finite-SNAC is developed in this study to synthesize finite-horizon optimal controllers for nonlinear control-affine systems. For satisfying the constraint on the input, a non-quadratic cost function is used. Inputs to the neural network are the current system states and the time-to-go and the network outputs are the costates which are used to compute the feedback control. Convergence of the reinforcement learning based training method to the optimal solution, the training error and the network weights are provided. The resulting controller is shown to solve the associated time-varying Hamilton-Jacobi-Bellman (HJB) equation and provide the fixed-final-time optimal solution. Performance of the new synthesis technique is demonstrated through an attitude control problem wherein a rigid spacecraft performs a finite time attitude maneuver subject to control bounds. The new formulation has a great potential for implementation since it consists of only one neural network with single set of weights and it provides comprehensive feedback solutions online though it is trained offline.
  • Keywords
    feedback; learning (artificial intelligence); neurocontrollers; nonlinear control systems; optimal control; time-varying systems; HJB equation; attitude control problem; feedback control; finite time attitude maneuver; finite-SNAC; finite-horizon input-constrained control; finite-horizon optimal controller; fixed-final-time optimal solution; neural network based controller; nonlinear control-affine system; nonlinear optimal control; nonquadratic cost function; reinforcement learning based training; single network adaptive critics; spacecraft; time-varying Hamilton-Jacobi-Bellman equation; Convergence; Cost function; Equations; Mathematical model; Optimal control; Satellites; Training;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    American Control Conference (ACC), 2011
  • Conference_Location
    San Francisco, CA
  • ISSN
    0743-1619
  • Print_ISBN
    978-1-4577-0080-4
  • Type

    conf

  • DOI
    10.1109/ACC.2011.5991378
  • Filename
    5991378