DocumentCode
2709419
Title
Adaptive dynamic programming-based optimal control of unknown affine nonlinear discrete-time systems
Author
Dierks, Travis ; Thumati, Balaje T. ; Jagannathan, S.
Author_Institution
Dept. of Electr. & Comput. Eng., Missouri Univ. of Sci. & Technol., Rolla, MO, USA
fYear
2009
fDate
14-19 June 2009
Firstpage
711
Lastpage
716
Abstract
Discrete time approximate dynamic programming (ADP) techniques have been widely used in the recent literature to determine the optimal or near optimal control policies for nonlinear systems. However, an inherent assumption of ADP requires at least partial knowledge of the system dynamics as well as the value of the controlled plant one step ahead. In this work, a novel approach to ADP is attempted while relaxing the need of the partial knowledge of the nonlinear system. The proposed methodology entails a two part process: online system identification and offline optimal control training. First, in the identification process, a neural network (NN) is tuned online to learn the complete plant dynamics and local asymptotic stability is shown under a mild assumption that the NN functional reconstruction errors lie within a small-gain type norm bounded conic sector. Then, using only the NN system model, offline ADP is attempted resulting in a novel optimal control law. The proposed scheme does not require explicit knowledge of the system dynamics as only the learned NN model is needed. Proof of convergence is demonstrated. Simulation results verify theoretical conjecture.
Keywords
affine transforms; asymptotic stability; discrete time systems; dynamic programming; neurocontrollers; nonlinear control systems; optimal control; NN functional reconstruction errors; adaptive dynamic programming; affine nonlinear discrete-time systems; asymptotic stability; discrete time approximate dynamic programming techniques; identification process; neural network; nonlinear systems; offline optimal control training; online system identification; plant dynamics; Adaptive control; Asymptotic stability; Control systems; Dynamic programming; Neural networks; Nonlinear dynamical systems; Nonlinear systems; Optimal control; Programmable control; System identification; Nonlinear optimal control; heuristic dynamic programming; neural network; system identification;
fLanguage
English
Publisher
ieee
Conference_Titel
Neural Networks, 2009. IJCNN 2009. International Joint Conference on
Conference_Location
Atlanta, GA
ISSN
1098-7576
Print_ISBN
978-1-4244-3548-7
Electronic_ISBN
1098-7576
Type
conf
DOI
10.1109/IJCNN.2009.5178776
Filename
5178776
Link To Document