Title :
On generalized policy iteration for continuous-time linear systems
Author :
Lee, Jae Young ; Chun, Tae Yoon ; Park, Jin Bae ; Choi, Yoon Ho
Author_Institution :
Dept. of Electr. & Electron. Eng., Yonsei Univ., Seoul, South Korea
Abstract :
This paper investigate the mathematical properties of generalized policy iteration (GPI) applied to a class of continuous-time linear systems with unknown internal dynamics. GPI is a class of dynamic programming (DP) method to solve an optimal control problem by using two consecutive steps-policy evaluation and policy improvement. We first provide several formula equivalent to GPI, and as a result, reveal its relations to linear quadratic optimal control problems and the fact that the computational complexity due to backup operations in policy evaluation steps can be lessened by increasing the time horizon of GPI. A variety of local stability and convergence criteria is also provided with the connection to the convergence speed. Finally, several numerical simulations are performed to verify the results.
Keywords :
computational complexity; continuous time systems; convergence of numerical methods; dynamic programming; iterative methods; linear quadratic control; linear systems; stability; GPI time horizon; backup operations; computational complexity; continuous-time linear systems; convergence criteria; dynamic programming method; generalized policy iteration; linear quadratic optimal control problems; local stability; mathematical properties; numerical simulations; policy evaluation steps; policy improvement; unknown internal dynamics; Europe;
Conference_Titel :
Decision and Control and European Control Conference (CDC-ECC), 2011 50th IEEE Conference on
Conference_Location :
Orlando, FL
Print_ISBN :
978-1-61284-800-6
Electronic_ISBN :
0743-1546
DOI :
10.1109/CDC.2011.6161462