Title :
Neural-network-based optimal control for discrete-time nonlinear systems using general value iteration
Author :
Hongliang, Li ; Derong, Liu ; Ding, Andwang
Author_Institution :
State Key Lab. of Manage. & Control for Complex Syst., Inst. of Autom., Beijing, China
Abstract :
In this paper, we propose a novel adaptive dynamic programming (ADP) scheme based on general value iteration to obtain near optimal control for discrete-time nonlinear systems with continuous state and control space. First, the selection of initial value function is different from the traditional value iteration, and a new method is introduced to demonstrate the convergence property and convergence speed of the value function. Then, the control law obtained at each iteration can stabilize the system under some conditions. At last, three neural networks with Levenberg-Marquardt training algorithm are used to approximate the unknown nonlinear system, the value function and the optimal control law. One simulation example is presented to demonstrate the effectiveness of the present scheme.
Keywords :
adaptive control; approximation theory; convergence of numerical methods; discrete time systems; dynamic programming; initial value problems; iterative methods; neurocontrollers; nonlinear control systems; optimal control; stability; ADP scheme; Levenberg-Marquardt training algorithm; adaptive dynamic programming scheme; continuous control space; continuous state space; convergence property; convergence speed; discrete-time nonlinear systems; general value iteration; initial value function selection; near optimal control; neural-network-based optimal control; Approximation algorithms; Approximation methods; Artificial neural networks; Convergence; Equations; Nonlinear systems; Optimal control; Adaptive dynamic programming; approximate dynamic programming; neural networks; optimal control; reinforcement learning; value iteration;
Conference_Titel :
Control Conference (CCC), 2012 31st Chinese
Conference_Location :
Hefei
Print_ISBN :
978-1-4673-2581-3