مرکز منطقه ای اطلاع رساني علوم و فناوري - Online reinforcement learning control of unknown nonaffine nonlinear discrete time systems

DocumentCode :

2830872

Title :

Online reinforcement learning control of unknown nonaffine nonlinear discrete time systems

Author :

Yang, Qinmin ; Jagannathan, S.

Author_Institution :

Univ. of Missouri-Rolla, Rolla

fYear :

2007

fDate :

12-14 Dec. 2007

Firstpage :

5942

Lastpage :

5947

Abstract :

In this paper, a novel neural network (NN) based online reinforcement learning controller is designed for nonaffine nonlinear discrete-time systems with bounded disturbances. The nonaffine systems are represented by nonlinear auto regressive moving average with exogenous input (NARMAX) model with unknown nonlinear functions. An equivalent affine-like representation for the tracking error dynamics is developed first from the original nonaffine system. Subsequently, a reinforcement learning-based neural network (NN) controller is proposed for the affine-like nonlinear error dynamic system. The control scheme consists of two NNs. One NN is designated as the critic, which approximates a predefined long-term cost function, whereas an action NN is employed to derive a control signal for the system to track a desired trajectory while minimizing the cost function simultaneously. Offline NN training is not required and online NN weight tuning rules are derived. By using the standard Lyapunov approach, the uniformly ultimate boundedness (UUB) of the tracking error and weight estimates is demonstrated.

Keywords :

Lyapunov methods; autoregressive moving average processes; control system synthesis; discrete time systems; learning systems; neurocontrollers; nonlinear control systems; performance index; Lyapunov approach; NARMAX model; affine-like representation; bounded disturbance; control signal; controller design; cost function; error dynamics tracking; neural network; nonaffine nonlinear discrete time systems; nonlinear autoregressive moving average with exogenous input model; nonlinear function; online reinforcement learning control; trajectory tracking; uniformly ultimate boundedness; weight estimate; Control systems; Cost function; Discrete time systems; Error correction; Learning; Neural networks; Nonlinear control systems; Nonlinear dynamical systems; Signal design; Trajectory;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Decision and Control, 2007 46th IEEE Conference on

Conference_Location :

New Orleans, LA

ISSN :

0191-2216

Print_ISBN :

978-1-4244-1497-0

Electronic_ISBN :

0191-2216

Type :

conf

DOI :

10.1109/CDC.2007.4434959

Filename :

4434959

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2830872