• DocumentCode
    26116
  • Title

    A Novel Iterative \\theta -Adaptive Dynamic Programming for Discrete-Time Nonlinear Systems

  • Author

    Qinglai Wei ; Derong Liu

  • Author_Institution
    State Key Lab. of Manage. & Control for Complex Syst., Inst. of Autom., Beijing, China
  • Volume
    11
  • Issue
    4
  • fYear
    2014
  • fDate
    Oct. 2014
  • Firstpage
    1176
  • Lastpage
    1190
  • Abstract
    This paper is concerned with a new iterative θ-adaptive dynamic programming (ADP) technique to solve optimal control problems of infinite horizon discrete-time nonlinear systems. The idea is to use an iterative ADP algorithm to obtain the iterative control law which optimizes the iterative performance index function. In the present iterative θ-ADP algorithm, the condition of initial admissible control in policy iteration algorithm is avoided. It is proved that all the iterative controls obtained in the iterative θ-ADP algorithm can stabilize the nonlinear system which means that the iterative θ-ADP algorithm is feasible for implementations both online and offline. Convergence analysis of the performance index function is presented to guarantee that the iterative performance index function will converge to the optimum monotonically. Neural networks are used to approximate the performance index function and compute the optimal control policy, respectively, for facilitating the implementation of the iterative θ-ADP algorithm. Finally, two simulation examples are given to illustrate the performance of the established method.
  • Keywords
    convergence; discrete time systems; dynamic programming; infinite horizon; iterative methods; neurocontrollers; nonlinear control systems; optimal control; performance index; stability; convergence analysis; discrete-time nonlinear systems; infinite horizon discrete-time nonlinear systems; iterative control law; iterative performance index function; iterative-ADP algorithm; iterative-adaptive dynamic programming; neural networks; nonlinear system stabilization; optimal control problems; Dynamic programming; Learning (artificial intelligence); Neural networks; Nonlinear systems; Optimal control; Adaptive critic designs; adaptive dynamic programming; approximate dynamic programming; neural networks; neuro-dynamic programming; nonlinear systems; optimal control; policy iteration; reinforcement learning; value iteration;
  • fLanguage
    English
  • Journal_Title
    Automation Science and Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1545-5955
  • Type

    jour

  • DOI
    10.1109/TASE.2013.2280974
  • Filename
    6609148