DocumentCode
2391971
Title
An application of convex optimization concepts to approximate dynamic programming
Author
Arruda, Edilson F. ; Fragoso, Marcelo D. ; Val, João Bosco R Do
Author_Institution
Dept. of Syst. & Control, Nat. Lab. for Sci. Comput., Petropolis
fYear
2008
fDate
11-13 June 2008
Firstpage
4238
Lastpage
4243
Abstract
This paper deals with approximate value iteration (AVI) algorithms applied to discounted dynamic (DP) programming problems. The so-called Bellman residual is shown to be convex in the Banach space of candidate solutions to the DP problem. This fact motivates the introduction of an AVI algorithm with local search that seeks an approximate solution in a lower dimensional space called approximation architecture. The optimality of a point in the approximation architecture is characterized by means of convex optimization concepts and necessary and sufficient conditions to global optimality are derived. To illustrate the method, two examples are presented which were previously explored in the literature.
Keywords
Banach spaces; approximation theory; convex programming; dynamic programming; iterative methods; Banach space; Bellman residual; approximate value iteration algorithm; convex optimization concept; dynamic programming; Approximation algorithms; Computer architecture; Convergence; Dynamic programming; Function approximation; Heuristic algorithms; Large-scale systems; Robustness; Stochastic processes; Sufficient conditions;
fLanguage
English
Publisher
ieee
Conference_Titel
American Control Conference, 2008
Conference_Location
Seattle, WA
ISSN
0743-1619
Print_ISBN
978-1-4244-2078-0
Electronic_ISBN
0743-1619
Type
conf
DOI
10.1109/ACC.2008.4587159
Filename
4587159
Link To Document