• DocumentCode
    1756885
  • Title

    Adaptive Optimal Control of Unknown Constrained-Input Systems Using Policy Iteration and Neural Networks

  • Author

    Modares, Hamidreza ; Lewis, Frank L. ; Naghibi-Sistani, Mohammad-Bagher

  • Author_Institution
    Dept. of Electr. Eng., Ferdowsi Univ. of Mashhad, Mashhad, Iran
  • Volume
    24
  • Issue
    10
  • fYear
    2013
  • fDate
    Oct. 2013
  • Firstpage
    1513
  • Lastpage
    1525
  • Abstract
    This paper presents an online policy iteration (PI) algorithm to learn the continuous-time optimal control solution for unknown constrained-input systems. The proposed PI algorithm is implemented on an actor-critic structure where two neural networks (NNs) are tuned online and simultaneously to generate the optimal bounded control policy. The requirement of complete knowledge of the system dynamics is obviated by employing a novel NN identifier in conjunction with the actor and critic NNs. It is shown how the identifier weights estimation error affects the convergence of the critic NN. A novel learning rule is developed to guarantee that the identifier weights converge to small neighborhoods of their ideal values exponentially fast. To provide an easy-to-check persistence of excitation condition, the experience replay technique is used. That is, recorded past experiences are used simultaneously with current data for the adaptation of the identifier weights. Stability of the whole system consisting of the actor, critic, system state, and system identifier is guaranteed while all three networks undergo adaptation. Convergence to a near-optimal control law is also shown. The effectiveness of the proposed method is illustrated with a simulation example.
  • Keywords
    adaptive control; continuous time systems; iterative methods; learning (artificial intelligence); neurocontrollers; optimal control; stability; NN identifier; PI algorithm; actor-critic structure; adaptive optimal control; continuous-time optimal control; estimation error; experience replay technique; learning rule; near-optimal control law; neural networks; online policy iteration algorithm; optimal bounded control policy; system identifier; system stability; system state; unknown constrained-input systems; Approximation methods; Artificial neural networks; Convergence; Equations; Heuristic algorithms; Mathematical model; Optimal control; Input constraints; neural networks; optimal control; reinforcement learning; unknown dynamics;
  • fLanguage
    English
  • Journal_Title
    Neural Networks and Learning Systems, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    2162-237X
  • Type

    jour

  • DOI
    10.1109/TNNLS.2013.2276571
  • Filename
    6583978