مرکز منطقه ای اطلاع رساني علوم و فناوري - Reinforcement Learning based Output-Feedback Control of Nonlinear Nonstrict Feedback Discrete-time Systems with Application to Engines

DocumentCode :

3176144

Title :

Reinforcement Learning based Output-Feedback Control of Nonlinear Nonstrict Feedback Discrete-time Systems with Application to Engines

Author :

Shih, Peter ; Vance, J. ; Jagannathan, S. ; Kaul, B. ; Drallmeier, James A.

Author_Institution :

Univ. of Missouri-Rolla, Rolla

fYear :

2007

fDate :

9-13 July 2007

Firstpage :

5106

Lastpage :

5111

Abstract :

A novel reinforcement-learning based output-adaptive neural network (NN) controller, also referred as the adaptive-critic NN controller, is developed to track a desired trajectory for a class of complex nonlinear discrete-time systems in the presence of bounded and unknown disturbances. The controller includes an observer for estimating states and the outputs, critic, and two action NNs for generating virtual, and actual control inputs. The critic approximates certain strategic utility function and the action NNs are used to minimize both the strategic utility function and their outputs. All NN weights adapt online towards minimization of a performance index, utilizing gradient-descent based rule. A Lyapunov function proves the uniformly ultimate boundedness (UUB) of the closed-loop tracking error, weight, and observer estimation. Separation principle and certainty equivalence principles are relaxed; persistency of excitation condition and linear in the unknown parameter assumption is not needed. The performance of this adaptive critic NN controller is evaluated through simulation with the Daw engine model in lean mode. The objective is to reduce the cyclic dispersion in heat release by using the controller.

Keywords :

Lyapunov methods; adaptive control; closed loop systems; discrete time systems; feedback; function approximation; gradient methods; internal combustion engines; large-scale systems; learning (artificial intelligence); learning systems; neurocontrollers; nonlinear control systems; observers; Daw engine model; Lyapunov function; adaptive neural network controller; adaptive-critic NN controller; certainty equivalence principle; closed-loop tracking error; complex nonlinear nonstrict feedback discrete-time system; gradient-descent based rule; heat release; internal combustion engine; observer state estimation; performance index; reinforcement learning based output-feedback control; separation principle; strategic utility function approximation; trajectory tracking; uniformly ultimate boundedness; Control systems; Engines; Learning; Neural networks; Neurofeedback; Nonlinear control systems; Observers; Output feedback; State estimation; Trajectory;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

American Control Conference, 2007. ACC '07

Conference_Location :

New York, NY

ISSN :

0743-1619

Print_ISBN :

1-4244-0988-8

Electronic_ISBN :

0743-1619

Type :

conf

DOI :

10.1109/ACC.2007.4283127

Filename :

4283127

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3176144