Online reinforcement learning control of unknown nonaffine nonlinear discrete time systems

Author

Yang, Qinmin ; Jagannathan, S.

Author_Institution

Univ. of Missouri-Rolla, Rolla

fYear

2007

fDate

12-14 Dec. 2007

Firstpage

5942

Lastpage

5947

Abstract

In this paper, a novel neural network (NN) based online reinforcement learning controller is designed for nonaffine nonlinear discrete-time systems with bounded disturbances. The nonaffine systems are represented by nonlinear auto regressive moving average with exogenous input (NARMAX) model with unknown nonlinear functions. An equivalent affine-like representation for the tracking error dynamics is developed first from the original nonaffine system. Subsequently, a reinforcement learning-based neural network (NN) controller is proposed for the affine-like nonlinear error dynamic system. The control scheme consists of two NNs. One NN is designated as the critic, which approximates a predefined long-term cost function, whereas an action NN is employed to derive a control signal for the system to track a desired trajectory while minimizing the cost function simultaneously. Offline NN training is not required and online NN weight tuning rules are derived. By using the standard Lyapunov approach, the uniformly ultimate boundedness (UUB) of the tracking error and weight estimates is demonstrated.

Keywords

Lyapunov methods; autoregressive moving average processes; control system synthesis; discrete time systems; learning systems; neurocontrollers; nonlinear control systems; performance index; Lyapunov approach; NARMAX model; affine-like representation; bounded disturbance; control signal; controller design; cost function; error dynamics tracking; neural network; nonaffine nonlinear discrete time systems; nonlinear autoregressive moving average with exogenous input model; nonlinear function; online reinforcement learning control; trajectory tracking; uniformly ultimate boundedness; weight estimate; Control systems; Cost function; Discrete time systems; Error correction; Learning; Neural networks; Nonlinear control systems; Nonlinear dynamical systems; Signal design; Trajectory;

fLanguage

English

Publisher

ieee

Conference_Titel

Decision and Control, 2007 46th IEEE Conference on

Conference_Location

New Orleans, LA

ISSN

0191-2216

Print_ISBN

978-1-4244-1497-0

Electronic_ISBN

0191-2216

Type

conf

DOI

10.1109/CDC.2007.4434959

Filename

4434959