Title :
Adaptive Suboptimal Output-Feedback Control for Linear Systems Using Integral Reinforcement Learning
Author :
Zhu, Lemei M. ; Modares, Hamidreza ; Peen, Gan Oon ; Lewis, Frank L. ; Baozeng Yue
Author_Institution :
Dept. of Basic Courses, North China Inst. of Sci. & Technol., Langfang, China
Abstract :
Reinforcement learning (RL) techniques have been successfully used to find optimal state-feedback controllers for continuous-time (CT) systems. However, in most real-world control applications, it is not practical to measure the system states and it is desirable to design output-feedback controllers. This paper develops an online learning algorithm based on the integral RL (IRL) technique to find a suboptimal output-feedback controller for partially unknown CT linear systems. The proposed IRL-based algorithm solves an IRL Bellman equation in each iteration online in real time to evaluate an output-feedback policy and updates the output-feedback gain using the information given by the evaluated policy. The knowledge of the system drift dynamics is not required by the proposed method. An adaptive observer is used to provide the knowledge of the full states for the IRL Bellman equation during learning. However, the observer is not needed after the learning process is finished. The convergence of the proposed algorithm to a suboptimal output-feedback solution and the performance of the proposed method are verified through simulation on two real-world applications, namely, the X-Y table and the F-16 aircraft.
Keywords :
adaptive control; continuous time systems; learning (artificial intelligence); learning systems; linear systems; observers; state feedback; suboptimal control; CT systems; F-16 aircraft; IRL Bellman equation; IRL technique; IRL-based algorithm; X-Y table; adaptive suboptimal output-feedback control; continuous-time systems; integral reinforcement learning technique; linear systems; online learning algorithm; optimal state-feedback controllers; output-feedback controllers; output-feedback gain; output-feedback policy; partially unknown CT linear systems; suboptimal output-feedback solution; Control systems; Convergence; Equations; Heuristic algorithms; Linear systems; Mathematical model; Observers; Integral reinforcement learning (IRL); linear continuous-time (CT) systems; optimal control; output feedback; output feedback.;
Journal_Title :
Control Systems Technology, IEEE Transactions on
DOI :
10.1109/TCST.2014.2322778