DocumentCode :
116304
Title :
Bounds for approximate dynamic programming based on string optimization and curvature
Author :
Yajing Liu ; Chong, Edwin K. P. ; Pezeshki, Ali ; Moran, Bill
Author_Institution :
Dept. of Electr. & Comput. Eng., Colorado State Univ., Fort Collins, CO, USA
fYear :
2014
fDate :
15-17 Dec. 2014
Firstpage :
6653
Lastpage :
6658
Abstract :
In this paper, we will develop a systematic approach to deriving guaranteed bounds for approximate dynamic programming (ADP) schemes in optimal control problems. Our approach is inspired by our recent results on bounding the performance of greedy strategies in optimization of string functions over a finite horizon. The approach is to derive a string-optimization problem, for which the optimal strategy is the optimal control solution and the greedy strategy is the ADP solution. Using this approach, we show that any ADP solution achieves a performance that is at least a factor of β of the performance of the optimal control solution, characterized by Bellman´s optimality principle. The factor β depends on the specific ADP scheme, as we will explicitly characterize. To illustrate the applicability of our bounding technique, we present examples of ADP schemes, including the popular rollout method.
Keywords :
approximation theory; dynamic programming; greedy algorithms; optimal control; Bellman´s optimality principle; approximate dynamic programming; bounding technique; curvature; finite horizon; greedy strategies; optimal control problems; rollout method; string function optimization; string optimization; string-optimization problem; Approximation methods; Dynamic programming; Educational institutions; Linear programming; Optimal control; Optimization; Systematics;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Decision and Control (CDC), 2014 IEEE 53rd Annual Conference on
Conference_Location :
Los Angeles, CA
Print_ISBN :
978-1-4799-7746-8
Type :
conf
DOI :
10.1109/CDC.2014.7040433
Filename :
7040433
Link To Document :
بازگشت