DocumentCode :
2830872
Title :
Online reinforcement learning control of unknown nonaffine nonlinear discrete time systems
Author :
Yang, Qinmin ; Jagannathan, S.
Author_Institution :
Univ. of Missouri-Rolla, Rolla
fYear :
2007
fDate :
12-14 Dec. 2007
Firstpage :
5942
Lastpage :
5947
Abstract :
In this paper, a novel neural network (NN) based online reinforcement learning controller is designed for nonaffine nonlinear discrete-time systems with bounded disturbances. The nonaffine systems are represented by nonlinear auto regressive moving average with exogenous input (NARMAX) model with unknown nonlinear functions. An equivalent affine-like representation for the tracking error dynamics is developed first from the original nonaffine system. Subsequently, a reinforcement learning-based neural network (NN) controller is proposed for the affine-like nonlinear error dynamic system. The control scheme consists of two NNs. One NN is designated as the critic, which approximates a predefined long-term cost function, whereas an action NN is employed to derive a control signal for the system to track a desired trajectory while minimizing the cost function simultaneously. Offline NN training is not required and online NN weight tuning rules are derived. By using the standard Lyapunov approach, the uniformly ultimate boundedness (UUB) of the tracking error and weight estimates is demonstrated.
Keywords :
Lyapunov methods; autoregressive moving average processes; control system synthesis; discrete time systems; learning systems; neurocontrollers; nonlinear control systems; performance index; Lyapunov approach; NARMAX model; affine-like representation; bounded disturbance; control signal; controller design; cost function; error dynamics tracking; neural network; nonaffine nonlinear discrete time systems; nonlinear autoregressive moving average with exogenous input model; nonlinear function; online reinforcement learning control; trajectory tracking; uniformly ultimate boundedness; weight estimate; Control systems; Cost function; Discrete time systems; Error correction; Learning; Neural networks; Nonlinear control systems; Nonlinear dynamical systems; Signal design; Trajectory;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Decision and Control, 2007 46th IEEE Conference on
Conference_Location :
New Orleans, LA
ISSN :
0191-2216
Print_ISBN :
978-1-4244-1497-0
Electronic_ISBN :
0191-2216
Type :
conf
DOI :
10.1109/CDC.2007.4434959
Filename :
4434959
Link To Document :
بازگشت