Title :
Reinforcement Learning Output Feedback NN Control Using Deterministic Learning Technique
Author :
Bin Xu ; Chenguang Yang ; Zhongke Shi
Author_Institution :
Sch. of Autom., Northwestern Polytech. Univ., Xi´an, China
Abstract :
In this brief, a novel adaptive-critic-based neural network (NN) controller is investigated for nonlinear pure-feedback systems. The controller design is based on the transformed predictor form, and the actor-critic NN control architecture includes two NNs, whereas the critic NN is used to approximate the strategic utility function, and the action NN is employed to minimize both the strategic utility function and the tracking error. A deterministic learning technique has been employed to guarantee that the partial persistent excitation condition of internal states is satisfied during tracking control to a periodic reference orbit. The uniformly ultimate boundedness of closed-loop signals is shown via Lyapunov stability analysis. Simulation results are presented to demonstrate the effectiveness of the proposed control.
Keywords :
Lyapunov methods; adaptive control; control system synthesis; feedback; learning (artificial intelligence); neurocontrollers; nonlinear control systems; stability; Lyapunov stability analysis; actor-critic NN control architecture; adaptive-critic-based neural network controller; closed-loop signal uniformly ultimate boundedness; controller design; deterministic learning technique; internal states; nonlinear pure-feedback systems; partial persistent excitation condition; periodic reference orbit tracking control; predictor form; reinforcement learning output feedback NN control; strategic utility function; Approximation methods; Artificial neural networks; Discrete-time systems; Learning systems; Nonlinear systems; Output feedback; Approximate dynamic programming; discrete-time system; output feedback control; pure-feedback system; radial basis function neural network (RBF NN);
Journal_Title :
Neural Networks and Learning Systems, IEEE Transactions on
DOI :
10.1109/TNNLS.2013.2292704