DocumentCode :
3269437
Title :
Optimistic planning for continuous-action deterministic systems
Author :
Busoniu, L. ; Daniels, Andrew ; Munos, Remi ; Babuska, Robert
Author_Institution :
CRAN, Univ. de Lorraine, Vandoeuvre les Nancy, France
fYear :
2013
fDate :
16-19 April 2013
Firstpage :
69
Lastpage :
76
Abstract :
We consider the class of online planning algorithms for optimal control, which compared to dynamic programming are relatively unaffected by large state dimensionality. We introduce a novel planning algorithm called SOOP that works for deterministic systems with continuous states and actions. SOOP is the first method to explore the true solution space, consisting of infinite sequences of continuous actions, without requiring knowledge about the smoothness of the system. SOOP can be used parameter-free at the cost of more model calls, but we also propose a more practical variant tuned by a parameter α, which balances finer discretization with longer planning horizons. Experiments on three problems show SOOP reliably ranks among the best algorithms, fully dominating competing methods when the problem requires both long horizons and fine discretization.
Keywords :
Markov processes; dynamic programming; optimal control; Markov decision process; SOOP; continuous-action deterministic systems; dynamic programming; online planning algorithm; optimal control; optimistic planning; Aerospace electronics; Dynamic programming; Heuristic algorithms; Measurement; Optimization; Planning; Upper bound;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Adaptive Dynamic Programming And Reinforcement Learning (ADPRL), 2013 IEEE Symposium on
Conference_Location :
Singapore
ISSN :
2325-1824
Type :
conf
DOI :
10.1109/ADPRL.2013.6614991
Filename :
6614991
Link To Document :
بازگشت