DocumentCode
2830872
Title
Online reinforcement learning control of unknown nonaffine nonlinear discrete time systems
Author
Yang, Qinmin ; Jagannathan, S.
Author_Institution
Univ. of Missouri-Rolla, Rolla
fYear
2007
fDate
12-14 Dec. 2007
Firstpage
5942
Lastpage
5947
Abstract
In this paper, a novel neural network (NN) based online reinforcement learning controller is designed for nonaffine nonlinear discrete-time systems with bounded disturbances. The nonaffine systems are represented by nonlinear auto regressive moving average with exogenous input (NARMAX) model with unknown nonlinear functions. An equivalent affine-like representation for the tracking error dynamics is developed first from the original nonaffine system. Subsequently, a reinforcement learning-based neural network (NN) controller is proposed for the affine-like nonlinear error dynamic system. The control scheme consists of two NNs. One NN is designated as the critic, which approximates a predefined long-term cost function, whereas an action NN is employed to derive a control signal for the system to track a desired trajectory while minimizing the cost function simultaneously. Offline NN training is not required and online NN weight tuning rules are derived. By using the standard Lyapunov approach, the uniformly ultimate boundedness (UUB) of the tracking error and weight estimates is demonstrated.
Keywords
Lyapunov methods; autoregressive moving average processes; control system synthesis; discrete time systems; learning systems; neurocontrollers; nonlinear control systems; performance index; Lyapunov approach; NARMAX model; affine-like representation; bounded disturbance; control signal; controller design; cost function; error dynamics tracking; neural network; nonaffine nonlinear discrete time systems; nonlinear autoregressive moving average with exogenous input model; nonlinear function; online reinforcement learning control; trajectory tracking; uniformly ultimate boundedness; weight estimate; Control systems; Cost function; Discrete time systems; Error correction; Learning; Neural networks; Nonlinear control systems; Nonlinear dynamical systems; Signal design; Trajectory;
fLanguage
English
Publisher
ieee
Conference_Titel
Decision and Control, 2007 46th IEEE Conference on
Conference_Location
New Orleans, LA
ISSN
0191-2216
Print_ISBN
978-1-4244-1497-0
Electronic_ISBN
0191-2216
Type
conf
DOI
10.1109/CDC.2007.4434959
Filename
4434959
Link To Document