DocumentCode :
741611
Title :
Model-Free Dual Heuristic Dynamic Programming
Author :
Zhen Ni ; Haibo He ; Xiangnan Zhong ; Prokhorov, Danil V.
Author_Institution :
Dept. of Electr., Univ. of Rhode Island, Kingston, RI, USA
Volume :
26
Issue :
8
fYear :
2015
Firstpage :
1834
Lastpage :
1839
Abstract :
Model-based dual heuristic dynamic programming (MB-DHP) is a popular approach in approximating optimal solutions in control problems. Yet, it usually requires offline training for the model network, and thus resulting in extra computational cost. In this brief, we propose a model-free DHP (MF-DHP) design based on finite-difference technique. In particular, we adopt multilayer perceptron with one hidden layer for both the action and the critic networks design, and use delayed objective functions to train both the action and the critic networks online over time. We test both the MF-DHP and MB-DHP approaches with a discrete time example and a continuous time example under the same parameter settings. Our simulation results demonstrate that the MF-DHP approach can obtain a control performance competitive with that of the traditional MB-DHP approach while requiring less computational resources.
Keywords :
dynamic programming; finite difference methods; heuristic programming; multilayer perceptrons; MB-DHP approach; MF-DHP approach; MF-DHP design; finite-difference technique; model network; model-based dual heuristic dynamic programming; model-free DHP design; model-free dual heuristic dynamic programming; multilayer perceptron; Approximation methods; Computational modeling; Convergence; Dynamic programming; Learning systems; Linear programming; Mathematical model; Action-dependent dual heuristic dynamic programming (DHP); adaptive critic designs (ACDs); adaptive dynamic programming (ADP); online learning; reinforcement learning; reinforcement learning.;
fLanguage :
English
Journal_Title :
Neural Networks and Learning Systems, IEEE Transactions on
Publisher :
ieee
ISSN :
2162-237X
Type :
jour
DOI :
10.1109/TNNLS.2015.2424971
Filename :
7101871
Link To Document :
بازگشت