DocumentCode
2419969
Title
Analyzing and enhancing direct NDP designs using a control-theoretic approach
Author
Yang, Lei ; Si, Jennie ; Tsakalis, S. Konstantinos S ; Rodriguez, Armando A.
Author_Institution
Dept. of Electr. Eng., Arizona State Univ., Tempe, AZ, USA
fYear
2003
fDate
8-8 Oct. 2003
Firstpage
529
Lastpage
532
Abstract
Direct NDP is in the family of approximate dynamic programming designs aiming at using learning and approximation methods to solve dynamic optimization problems formulated in dynamic programming, and to overcome the curse of dimensionality. Due to the statistical learning nature of the approaches, researchers usually make use of statistical measures to evaluate the design performance of the learning system such as the learning speed and the variation from one learning experience to the other. However, there are no systematic studies to date that address closed loop system performance from an input-output functional perspective. This paper analyzes direct NDP designs using classic control-theoretic sensitivity arguments. By using the benchmark cart-pole problem, it is shown that direct NDP uses an LQR with desired closed-loop properties as a learning guide, it is more likely for direct NDP to generate better designs than a direct NDP learning from scratch. Although the approach and results are illustrated using a simple nonlinear cart-pole system, it is clear that they are readily extended to more complex dynamical systems.
Keywords
closed loop systems; dynamic programming; learning systems; linear quadratic control; nonlinear systems; sensitivity analysis; LQR; approximation methods; benchmark cart-pole problem; closed loop system performance; complex dynamical systems; control-theoretic sensitivity arguments; direct NDP designs; dynamic optimization; learning methods; learning system; linear quadratic regulator; neuro dynamic programming; nonlinear cart-pole system;
fLanguage
English
Publisher
ieee
Conference_Titel
Intelligent Control. 2003 IEEE International Symposium on
Conference_Location
Houston, TX, USA
ISSN
2158-9860
Print_ISBN
0-7803-7891-1
Type
conf
DOI
10.1109/ISIC.2003.1254691
Filename
1254691
Link To Document