Title :
Data-Driven
Control for Nonlinear Distributed Parameter Systems
Author :
Biao Luo ; Tingwen Huang ; Huai-Ning Wu ; Xiong Yang
Author_Institution :
Sch. of Autom. & Electr. Eng., Univ. of Sci. & Technol. Beijing, Beijing, China
Abstract :
The data-driven H∞ control problem of nonlinear distributed parameter systems is considered in this paper. An off-policy learning method is developed to learn the H∞ control policy from real system data rather than the mathematical model. First, Karhunen-Loève decomposition is used to compute the empirical eigenfunctions, which are then employed to derive a reduced-order model (ROM) of slow subsystem based on the singular perturbation theory. The H∞ control problem is reformulated based on the ROM, which can be transformed to solve the Hamilton-Jacobi-Isaacs (HJI) equation, theoretically. To learn the solution of the HJI equation from real system data, a data-driven off-policy learning approach is proposed based on the simultaneous policy update algorithm and its convergence is proved. For implementation purpose, a neural network (NN)- based action-critic structure is developed, where a critic NN and two action NNs are employed to approximate the value function, control, and disturbance policies, respectively. Subsequently, a least-square NN weight-tuning rule is derived with the method of weighted residuals. Finally, the developed data-driven off-policy learning approach is applied to a nonlinear diffusion-reaction process, and the obtained results demonstrate its effectiveness.
Keywords :
H∞ control; Karhunen-Loeve transforms; convergence of numerical methods; distributed parameter systems; eigenvalues and eigenfunctions; learning systems; least squares approximations; neurocontrollers; nonlinear control systems; reduced order systems; HJI equation; Hamilton-Jacobi-Isaacs equation; Karhunen-Loève decomposition; ROM; convergence; data-driven H∞ control problem; diffusion-reaction process; disturbance policies; empirical eigenfunctions; least-square NN weight-tuning rule; neural network based action-critic structure; nonlinear distributed parameter systems; off-policy learning method; real system data; reduced-order model; simultaneous policy update algorithm; singular perturbation theory; slow subsystem; value function; weighted residuals; Approximation algorithms; Artificial neural networks; Attenuation; Computational modeling; Mathematical model; Read only memory; Reduced order systems; $ H_infty $ control; Data driven; H∞ control; Hamilton-Jacobi-Isaacs (HJI) equation; Hamilton???Jacobi???Isaacs (HJI) equation; distributed parameter systems (DSPs); neural network (NN); off-policy learning; partial differential equation (PDE); partial differential equation (PDE).;
Journal_Title :
Neural Networks and Learning Systems, IEEE Transactions on
DOI :
10.1109/TNNLS.2015.2461023