مرکز منطقه ای اطلاع رساني علوم و فناوري - Error bound analysis of policy iteration based approximate dynamic programming for deterministic discrete-time nonlinear systems

DocumentCode :

3661469

Title :

Error bound analysis of policy iteration based approximate dynamic programming for deterministic discrete-time nonlinear systems

Author :

Wentao Guo;Feng Liu;Jennie Si;Shengwei Mei;Rui Li

Author_Institution :

State Key Laboratory of Power Systems, Department of Electrical Engineering, Tsinghua University, Beijing 100084, China

fYear :

2015

fDate :

7/1/2015 12:00:00 AM

Firstpage :

Lastpage :

Abstract :

Extensive approximate dynamic programming (ADP) algorithms have been developed based on policy iteration. For policy iteration based ADP of deterministic discrete-time nonlinear systems, existing literature has proved its convergence in the formulation of undiscounted value function under the assumption of exact approximation. Furthermore, the error bound of policy iteration based ADP has been analyzed in a discounted value function formulation with consideration of approximation errors. However, there has not been any error bound analysis of policy iteration based ADP in the undiscounted value function formulation with consideration of approximation errors. In this paper, we intend to fill this theoretical gap. We provide a sufficient condition on the approximation error, so that the iterative value function can be bounded in a neighbourhood of the optimal value function. To the best of the authors´ knowledge, this is the first error bound result of the undiscounted policy iteration for deterministic discrete-time nonlinear systems considering approximation errors.

Keywords :

"Approximation methods","Mathematical model","Approximation algorithms"

Publisher :

ieee

Conference_Titel :

Neural Networks (IJCNN), 2015 International Joint Conference on

Electronic_ISBN :

2161-4407

Type :

conf

DOI :

10.1109/IJCNN.2015.7280783

Filename :

7280783

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3661469