Adaptive optimal controllers based on Generalized Policy Iteration in a continuous-time framework

Author

Vrabie, Draguna ; Vamvoudakis, Kyriakos ; Lewis, Frank

Author_Institution

Autom. & Robot. Res. Inst., Univ. of Texas at Arlington, Arlington, TX, USA

fYear

2009

fDate

24-26 June 2009

Firstpage

1402

Lastpage

1409

Abstract

In this paper we present two adaptive algorithms which offer solution to the continuous-time optimal control problem for nonlinear, affine in the inputs, time-invariant systems. Both algorithms were developed based on the generalized policy iteration technique and involve adaptation of two neural network structures namely actor, providing the control signal, and critic, performing evaluation of the control performance. Despite the similarities, the two adaptive algorithms differ in the manner in which the adaptation takes place, required knowledge on the system dynamics, and formulation of the persistence of excitation requirement. The main difference is that one algorithm uses sequential adaptation of the actor and critic structures, i.e. while one is trained the other one is kept constant, while for the second algorithm the two neural networks are trained synchronously in a continuous-time fashion. The two algorithms are described in detail and proof of convergence is provided. Simulation results of applying the two algorithms for finding the optimal state feedback controller of a nonlinear system are also presented.

Keywords

adaptive control; continuous time systems; neurocontrollers; nonlinear control systems; optimal control; adaptive actor algorithm; adaptive critic algorithm; adaptive optimal controller; continuous-time framework; generalized policy iteration; neural network; nonlinear system; time-invariant system; Adaptive algorithm; Adaptive control; Convergence; Neural networks; Nonlinear control systems; Nonlinear systems; Optimal control; Performance evaluation; Programmable control; State feedback; Generalized Policy Iteration; adaptive critics; adaptive optimal control; continuous-time; neural networks;

fLanguage

English

Publisher

ieee

Conference_Titel

Control and Automation, 2009. MED '09. 17th Mediterranean Conference on

Conference_Location

Thessaloniki

Print_ISBN

978-1-4244-4684-1

Electronic_ISBN

978-1-4244-4685-8

Type

conf

DOI

10.1109/MED.2009.5164743

Filename

5164743