مرکز منطقه ای اطلاع رساني علوم و فناوري - Optimistic planning for continuous-action deterministic systems

DocumentCode :

3269437

Title :

Optimistic planning for continuous-action deterministic systems

Author :

Busoniu, L. ; Daniels, Andrew ; Munos, Remi ; Babuska, Robert

Author_Institution :

CRAN, Univ. de Lorraine, Vandoeuvre les Nancy, France

fYear :

2013

fDate :

16-19 April 2013

Firstpage :

Lastpage :

Abstract :

We consider the class of online planning algorithms for optimal control, which compared to dynamic programming are relatively unaffected by large state dimensionality. We introduce a novel planning algorithm called SOOP that works for deterministic systems with continuous states and actions. SOOP is the first method to explore the true solution space, consisting of infinite sequences of continuous actions, without requiring knowledge about the smoothness of the system. SOOP can be used parameter-free at the cost of more model calls, but we also propose a more practical variant tuned by a parameter α, which balances finer discretization with longer planning horizons. Experiments on three problems show SOOP reliably ranks among the best algorithms, fully dominating competing methods when the problem requires both long horizons and fine discretization.

Keywords :

Markov processes; dynamic programming; optimal control; Markov decision process; SOOP; continuous-action deterministic systems; dynamic programming; online planning algorithm; optimal control; optimistic planning; Aerospace electronics; Dynamic programming; Heuristic algorithms; Measurement; Optimization; Planning; Upper bound;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Adaptive Dynamic Programming And Reinforcement Learning (ADPRL), 2013 IEEE Symposium on

Conference_Location :

Singapore

ISSN :

2325-1824

Type :

conf

DOI :

10.1109/ADPRL.2013.6614991

Filename :

6614991

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3269437