Reinforcement Learning Neural-Network-Based Controller for Nonlinear Discrete-Time Systems With Input Constraints

Author

He, Pingan ; Jagannathan, S.

Author_Institution

Dept. of Electr. & Comput. Eng., Missouri Univ., Rolla, MO

Volume

37

Issue

2

fYear

2007

fDate

4/1/2007 12:00:00 AM

Firstpage

425

Lastpage

436

Abstract

A novel adaptive-critic-based neural network (NN) controller in discrete time is designed to deliver a desired tracking performance for a class of nonlinear systems in the presence of actuator constraints. The constraints of the actuator are treated in the controller design as the saturation nonlinearity. The adaptive critic NN controller architecture based on state feedback includes two NNs: the critic NN is used to approximate the "strategic" utility function, whereas the action NN is employed to minimize both the strategic utility function and the unknown nonlinear dynamic estimation errors. The critic and action NN weight updates are derived by minimizing certain quadratic performance indexes. Using the Lyapunov approach and with novel weight updates, the uniformly ultimate boundedness of the closed-loop tracking error and weight estimates is shown in the presence of NN approximation errors and bounded unknown disturbances. The proposed NN controller works in the presence of multiple nonlinearities, unlike other schemes that normally approximate one nonlinearity. Moreover, the adaptive critic NN controller does not require an explicit offline training phase, and the NN weights can be initialized at zero or random. Simulation results justify the theoretical analysis

Keywords

Lyapunov methods; closed loop systems; control system synthesis; discrete time systems; feedback; learning (artificial intelligence); neurocontrollers; nonlinear systems; Lyapunov approach; closed-loop tracking error; neural-network-based controller; nonlinear discrete-time system; quadratic performance; reinforcement learning; state feedback control; Actuators; Adaptive control; Control systems; Estimation error; Learning; Neural networks; Nonlinear control systems; Nonlinear systems; Programmable control; State feedback; Approximate dynamic programming; neural network control; o ptimal control; reinforcement learning; Algorithms; Artificial Intelligence; Computer Simulation; Feedback; Models, Theoretical; Neural Networks (Computer); Nonlinear Dynamics; Pattern Recognition, Automated; Reinforcement (Psychology); Signal Processing, Computer-Assisted;

fLanguage

English

Journal_Title

Systems, Man, and Cybernetics, Part B: Cybernetics, IEEE Transactions on

Publisher

ieee

ISSN

1083-4419

Type

jour

DOI

10.1109/TSMCB.2006.883869

Filename

4126287