• DocumentCode
    2830872
  • Title

    Online reinforcement learning control of unknown nonaffine nonlinear discrete time systems

  • Author

    Yang, Qinmin ; Jagannathan, S.

  • Author_Institution
    Univ. of Missouri-Rolla, Rolla
  • fYear
    2007
  • fDate
    12-14 Dec. 2007
  • Firstpage
    5942
  • Lastpage
    5947
  • Abstract
    In this paper, a novel neural network (NN) based online reinforcement learning controller is designed for nonaffine nonlinear discrete-time systems with bounded disturbances. The nonaffine systems are represented by nonlinear auto regressive moving average with exogenous input (NARMAX) model with unknown nonlinear functions. An equivalent affine-like representation for the tracking error dynamics is developed first from the original nonaffine system. Subsequently, a reinforcement learning-based neural network (NN) controller is proposed for the affine-like nonlinear error dynamic system. The control scheme consists of two NNs. One NN is designated as the critic, which approximates a predefined long-term cost function, whereas an action NN is employed to derive a control signal for the system to track a desired trajectory while minimizing the cost function simultaneously. Offline NN training is not required and online NN weight tuning rules are derived. By using the standard Lyapunov approach, the uniformly ultimate boundedness (UUB) of the tracking error and weight estimates is demonstrated.
  • Keywords
    Lyapunov methods; autoregressive moving average processes; control system synthesis; discrete time systems; learning systems; neurocontrollers; nonlinear control systems; performance index; Lyapunov approach; NARMAX model; affine-like representation; bounded disturbance; control signal; controller design; cost function; error dynamics tracking; neural network; nonaffine nonlinear discrete time systems; nonlinear autoregressive moving average with exogenous input model; nonlinear function; online reinforcement learning control; trajectory tracking; uniformly ultimate boundedness; weight estimate; Control systems; Cost function; Discrete time systems; Error correction; Learning; Neural networks; Nonlinear control systems; Nonlinear dynamical systems; Signal design; Trajectory;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Decision and Control, 2007 46th IEEE Conference on
  • Conference_Location
    New Orleans, LA
  • ISSN
    0191-2216
  • Print_ISBN
    978-1-4244-1497-0
  • Electronic_ISBN
    0191-2216
  • Type

    conf

  • DOI
    10.1109/CDC.2007.4434959
  • Filename
    4434959