• DocumentCode
    808697
  • Title

    Control of Nonaffine Nonlinear Discrete-Time Systems Using Reinforcement-Learning-Based Linearly Parameterized Neural Networks

  • Author

    Yang, Qinmin ; Vance, Jonathan Blake ; Jagannathan, S.

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Missouri Univ. of Sci. & Technol., Rolla, MO
  • Volume
    38
  • Issue
    4
  • fYear
    2008
  • Firstpage
    994
  • Lastpage
    1001
  • Abstract
    A nonaffine discrete-time system represented by the nonlinear autoregressive moving average with eXogenous input (NARMAX) representation with unknown nonlinear system dynamics is considered. An equivalent affinelike representation in terms of the tracking error dynamics is first obtained from the original nonaffine nonlinear discrete-time system so that reinforcement-learning-based near-optimal neural network (NN) controller can be developed. The control scheme consists of two linearly parameterized NNs. One NN is designated as the critic NN, which approximates a predefined long-term cost function, and an action NN is employed to derive a near-optimal control signal for the system to track a desired trajectory while minimizing the cost function simultaneously. The NN weights are tuned online. By using the standard Lyapunov approach, the stability of the closed-loop system is shown. The net result is a supervised actor-critic NN controller scheme which can be applied to a general nonaffine nonlinear discrete-time system without needing the affinelike representation. Simulation results demonstrate satisfactory performance of the controller.
  • Keywords
    Lyapunov methods; autoregressive moving average processes; closed loop systems; discrete time systems; learning (artificial intelligence); linear systems; minimisation; neurocontrollers; nonlinear control systems; optimal control; stability; tracking; Lyapunov approach; closed-loop system stability; cost function minimization; linear parameterized neural network; near-optimal control signal; nonaffine nonlinear discrete-time system control; nonlinear autoregressive moving average-exogenous input; nonlinear system dynamics; reinforcement-learning; tracking error dynamics; Control systems; Cost function; Error correction; Neural networks; Nonlinear control systems; Nonlinear dynamical systems; Nonlinear systems; Signal design; Stability; Trajectory; Adaptive critic; Lyapunov stability; adaptive dynamic programming; neural network control; reinforcement learning control; Algorithms; Computer Simulation; Feedback; Models, Theoretical; Neural Networks (Computer); Nonlinear Dynamics; Programming, Linear; Reinforcement (Psychology); Signal Processing, Computer-Assisted;
  • fLanguage
    English
  • Journal_Title
    Systems, Man, and Cybernetics, Part B: Cybernetics, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1083-4419
  • Type

    jour

  • DOI
    10.1109/TSMCB.2008.926607
  • Filename
    4567550