DocumentCode
1227319
Title
Reinforcement Learning Neural-Network-Based Controller for Nonlinear Discrete-Time Systems With Input Constraints
Author
He, Pingan ; Jagannathan, S.
Author_Institution
Dept. of Electr. & Comput. Eng., Missouri Univ., Rolla, MO
Volume
37
Issue
2
fYear
2007
fDate
4/1/2007 12:00:00 AM
Firstpage
425
Lastpage
436
Abstract
A novel adaptive-critic-based neural network (NN) controller in discrete time is designed to deliver a desired tracking performance for a class of nonlinear systems in the presence of actuator constraints. The constraints of the actuator are treated in the controller design as the saturation nonlinearity. The adaptive critic NN controller architecture based on state feedback includes two NNs: the critic NN is used to approximate the "strategic" utility function, whereas the action NN is employed to minimize both the strategic utility function and the unknown nonlinear dynamic estimation errors. The critic and action NN weight updates are derived by minimizing certain quadratic performance indexes. Using the Lyapunov approach and with novel weight updates, the uniformly ultimate boundedness of the closed-loop tracking error and weight estimates is shown in the presence of NN approximation errors and bounded unknown disturbances. The proposed NN controller works in the presence of multiple nonlinearities, unlike other schemes that normally approximate one nonlinearity. Moreover, the adaptive critic NN controller does not require an explicit offline training phase, and the NN weights can be initialized at zero or random. Simulation results justify the theoretical analysis
Keywords
Lyapunov methods; closed loop systems; control system synthesis; discrete time systems; feedback; learning (artificial intelligence); neurocontrollers; nonlinear systems; Lyapunov approach; closed-loop tracking error; neural-network-based controller; nonlinear discrete-time system; quadratic performance; reinforcement learning; state feedback control; Actuators; Adaptive control; Control systems; Estimation error; Learning; Neural networks; Nonlinear control systems; Nonlinear systems; Programmable control; State feedback; Approximate dynamic programming; neural network control; o ptimal control; reinforcement learning; Algorithms; Artificial Intelligence; Computer Simulation; Feedback; Models, Theoretical; Neural Networks (Computer); Nonlinear Dynamics; Pattern Recognition, Automated; Reinforcement (Psychology); Signal Processing, Computer-Assisted;
fLanguage
English
Journal_Title
Systems, Man, and Cybernetics, Part B: Cybernetics, IEEE Transactions on
Publisher
ieee
ISSN
1083-4419
Type
jour
DOI
10.1109/TSMCB.2006.883869
Filename
4126287
Link To Document