DocumentCode :
3653581
Title :
PNLMS-based Algorithm for Online Approximated Solution of HJB Equation in the Context of Discrete MIMO Optimal Control and Reinforcement Learning
Author :
Márcio Eduardo G. ;João Viana da Fonseca ;Francisco das Chagas de Souza
Author_Institution :
Dept. of Electr. Eng., Fed. Univ. of Maranhao, Sã
fYear :
2014
fDate :
3/1/2014 12:00:00 AM
Firstpage :
69
Lastpage :
76
Abstract :
In the context of discrete multivariable optimal linear quadratic regulator and reinforcement learning, this paper is attained to an investigation on PNLMS algorithm for online approximated solution of Hamilton-Jacobi-Bellman Equation. The PNLMS-based algorithms is designed to estimate the parameters of value function that is guided by Heuristic Dynamic Programming approach. The problem is characterized as approximation problem of cost value function that is computed without dynamic system model, but only by the measurements of states and action of interaction between the real world system and controller engine design. The proposed PNLMS-based algorithm is evaluated in a simplified mathematical model of the longitudinal dynamics of an aircraft that is a multivariable system. The performance of PNLMS-based algorithm is compared with those of RLS-based and LMS-based algorithms. These comparisons are performed in terms of convergence aspects and computational cost for online implementations of optimal design methodlogy.
Keywords :
"Mathematical model","Equations","Heuristic algorithms","Algorithm design and analysis","Least squares approximations","Aircraft","Atmospheric modeling"
Publisher :
ieee
Conference_Titel :
Computer Modelling and Simulation (UKSim), 2014 UKSim-AMSS 16th International Conference on
Type :
conf
DOI :
10.1109/UKSim.2014.108
Filename :
7046041
Link To Document :
بازگشت