Online Adaptive Policy Learning Algorithm for $H_{\\infty }$ State Feedback Control of Unknown Affine Nonlinear Discrete-Time Systems

Author

Huaguang Zhang ; Chunbin Qin ; Bin Jiang ; Yanhong Luo

Author_Institution

Coll. of Inf. Sci. & Eng., Northeastern Univ., Shenyang, China

Volume

44

Issue

12

fYear

2014

fDate

Dec. 2014

Firstpage

2706

Lastpage

2718

Abstract

The problem of H_∞ state feedback control of affine nonlinear discrete-time systems with unknown dynamics is investigated in this paper. An online adaptive policy learning algorithm (APLA) based on adaptive dynamic programming (ADP) is proposed for learning in real-time the solution to the Hamilton-Jacobi-Isaacs (HJI) equation, which appears in the H_∞ control problem. In the proposed algorithm, three neural networks (NNs) are utilized to find suitable approximations of the optimal value function and the saddle point feedback control and disturbance policies. Novel weight updating laws are given to tune the critic, actor, and disturbance NNs simultaneously by using data generated in real-time along the system trajectories. Considering NN approximation errors, we provide the stability analysis of the proposed algorithm with Lyapunov approach. Moreover, the need of the system input dynamics for the proposed algorithm is relaxed by using a NN identification scheme. Finally, simulation examples show the effectiveness of the proposed algorithm.

Keywords

H^∞ control; Lyapunov methods; adaptive control; discrete time systems; dynamic programming; function approximation; learning systems; neurocontrollers; nonlinear control systems; stability; state feedback; ADP; APLA; H_∞ state feedback control; HJI equation; Hamilton-Jacobi-Isaacs equation; Lyapunov approach; NN approximation errors; NN identification scheme; adaptive dynamic programming; disturbance NNs; neural networks; online adaptive policy learning algorithm; optimal value function approximations; saddle point feedback control; stability analysis; unknown affine nonlinear discrete-time systems; weight updating laws; Adaptive systems; Discrete-time systems; Dynamic programming; H infinity control; Neural networks; (H_{infty }) control; Adaptive dynamic programming; H∞ control; neural networks; nonlinear discrete-time system; zero-sum game;

fLanguage

English

Journal_Title

Cybernetics, IEEE Transactions on

Publisher

ieee

ISSN

2168-2267

Type

jour

DOI

10.1109/TCYB.2014.2313915

Filename

6866861

Online Adaptive Policy Learning Algorithm for State Feedback Control of Unknown Affine Nonlinear Discrete-Time Systems

Huaguang Zhang ; Chunbin Qin ; Bin Jiang ; Yanhong Luo

jour

Online Adaptive Policy Learning Algorithm for $H_{\\infty }$ State Feedback Control of Unknown Affine Nonlinear Discrete-Time Systems