DocumentCode :
3653582
Title :
RLS Algorithms and Convergence Analysis Method for Online DLQR Control Design via Heuristic Dynamic Programming
Author :
Watson R.M. Santos;Jonathan A. Queiroz;João Viana da F. ;Patrícia H. M. Rêgo;Ewaldo Santana;Gustavo Andrade
Author_Institution :
Embedded Syst. &
fYear :
2014
fDate :
3/1/2014 12:00:00 AM
Firstpage :
77
Lastpage :
83
Abstract :
In this paper, a method to design online optimal policies that encompasses Hamilton-Jacobi-Bellman (HJB) equation solution approximation and heuristic dynamic programming (HDP) approach is proposed. Recursive least squares (RLS) algorithms are developed to approximate the HJB equation solution that is supported by a sequence of greedy policies. The proposal investigates the convergence properties of a family of RLS algorithms and its numerical complexity in the context of reinforcement learning and optimal control. The algorithms are computationally evaluated in an electric circuit model that represents an MIMO dynamic system. The results presented herein emphasize the convergence behaviour of the RLS, projection and Kaczmarz algorithms that are developed for online applications.
Keywords :
"Equations","Mathematical model","Least squares approximations","Heuristic algorithms","Approximation algorithms","Convergence"
Publisher :
ieee
Conference_Titel :
Computer Modelling and Simulation (UKSim), 2014 UKSim-AMSS 16th International Conference on
Type :
conf
DOI :
10.1109/UKSim.2014.109
Filename :
7046042
Link To Document :
بازگشت