• DocumentCode
    1755415
  • Title

    Finite-Horizon Control-Constrained Nonlinear Optimal Control Using Single Network Adaptive Critics

  • Author

    Heydari, Ali ; Balakrishnan, Sivasubramanya N.

  • Author_Institution
    Dept. of Mech. & Aerosp. Eng., Missouri Univ. of Sci. & Technol., Rolla, MO, USA
  • Volume
    24
  • Issue
    1
  • fYear
    2013
  • fDate
    Jan. 2013
  • Firstpage
    145
  • Lastpage
    157
  • Abstract
    To synthesize fixed-final-time control-constrained optimal controllers for discrete-time nonlinear control-affine systems, a single neural network (NN)-based controller called the Finite-horizon Single Network Adaptive Critic is developed in this paper. Inputs to the NN are the current system states and the time-to-go, and the network outputs are the costates that are used to compute optimal feedback control. Control constraints are handled through a nonquadratic cost function. Convergence proofs of: 1) the reinforcement learning-based training method to the optimal solution; 2) the training error; and 3) the network weights are provided. The resulting controller is shown to solve the associated time-varying Hamilton-Jacobi-Bellman equation and provide the fixed-final-time optimal solution. Performance of the new synthesis technique is demonstrated through different examples including an attitude control problem wherein a rigid spacecraft performs a finite-time attitude maneuver subject to control bounds. The new formulation has great potential for implementation since it consists of only one NN with single set of weights and it provides comprehensive feedback solutions online, though it is trained offline.
  • Keywords
    Jacobian matrices; adaptive control; discrete time systems; feedback; learning (artificial intelligence); neurocontrollers; nonlinear control systems; optimal control; Hamilton-Jacobi-Bellman equation; NN; discrete time nonlinear control affine systems; finite horizon control constrained nonlinear optimal control; neural network; neural network based controller; nonquadratic cost function; optimal feedback control; reinforcement learning based training method; single network adaptive critics; Artificial neural networks; Convergence; Equations; Mathematical model; Optimal control; Training; Vectors; Approximate dynamic programming; finite-horizon optimal control; fixed-final-time optimal control; input-constraint; neural networks;
  • fLanguage
    English
  • Journal_Title
    Neural Networks and Learning Systems, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    2162-237X
  • Type

    jour

  • DOI
    10.1109/TNNLS.2012.2227339
  • Filename
    6377305