مرکز منطقه ای اطلاع رساني علوم و فناوري - PNLMS-based Algorithm for Online Approximated Solution of HJB Equation in the Context of Discrete MIMO Optimal Control and Reinforcement Learning

DocumentCode :

3653581

Title :

PNLMS-based Algorithm for Online Approximated Solution of HJB Equation in the Context of Discrete MIMO Optimal Control and Reinforcement Learning

Author :

Márcio Eduardo G. ;João Viana da Fonseca ;Francisco das Chagas de Souza

Author_Institution :

Dept. of Electr. Eng., Fed. Univ. of Maranhao, Sã

fYear :

2014

fDate :

3/1/2014 12:00:00 AM

Firstpage :

Lastpage :

Abstract :

In the context of discrete multivariable optimal linear quadratic regulator and reinforcement learning, this paper is attained to an investigation on PNLMS algorithm for online approximated solution of Hamilton-Jacobi-Bellman Equation. The PNLMS-based algorithms is designed to estimate the parameters of value function that is guided by Heuristic Dynamic Programming approach. The problem is characterized as approximation problem of cost value function that is computed without dynamic system model, but only by the measurements of states and action of interaction between the real world system and controller engine design. The proposed PNLMS-based algorithm is evaluated in a simplified mathematical model of the longitudinal dynamics of an aircraft that is a multivariable system. The performance of PNLMS-based algorithm is compared with those of RLS-based and LMS-based algorithms. These comparisons are performed in terms of convergence aspects and computational cost for online implementations of optimal design methodlogy.

Keywords :

"Mathematical model","Equations","Heuristic algorithms","Algorithm design and analysis","Least squares approximations","Aircraft","Atmospheric modeling"

Publisher :

ieee

Conference_Titel :

Computer Modelling and Simulation (UKSim), 2014 UKSim-AMSS 16th International Conference on

Type :

conf

DOI :

10.1109/UKSim.2014.108

Filename :

7046041

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3653581