مرکز منطقه ای اطلاع رساني علوم و فناوري - Construction of embedded Markov decision processes for optimal control of non-linear systems with continuous state spaces

DocumentCode :

3443304

Title :

Construction of embedded Markov decision processes for optimal control of non-linear systems with continuous state spaces

Author :

Nikovski, Daniel ; Esenther, Alan

Author_Institution :

Mitsubishi Electr. Res. Labs., Cambridge, MA, USA

fYear :

2011

fDate :

12-15 Dec. 2011

Firstpage :

7944

Lastpage :

7949

Abstract :

We consider the problem of constructing a suitable discrete-state approximation of an arbitrary non-linear dynamical system with continuous state space and discrete control actions that would allow close to optimal sequential control of that system by means of value or policy iteration on the approximated model. We propose a method for approximating the continuous dynamics by means of an embedded Markov decision process (MDP) model defined over an arbitrary set of discrete states sampled from the original continuous state space. The mathematical similarity between sets of barycentric coordinates (convex combinations) and probability mass functions is exploited to compute the transition matrices and initial state distribution of the MDP. Barycentric coordinates are computed efficiently on a Delaunay triangulation of the set of discrete states, ensuring maximal accuracy of the approximation and the resulting control policy.

Keywords :

Markov processes; approximation theory; decision making; embedded systems; mesh generation; nonlinear control systems; nonlinear dynamical systems; optimal control; probability; state-space methods; Delaunay triangulation; arbitrary nonlinear dynamical system; barycentric coordinates; continuous state spaces; control policy; discrete control actions; discrete state approximation; embedded Markov decision process model; initial state distribution; mathematical similarity; optimal control; optimal sequential control; policy iteration; probability mass functions; transition matrices; Aerospace electronics; Heuristic algorithms; Learning; Markov processes; Optimal control; Trajectory; Vectors; Markov decision process models; dynamic programming; embedded Markov chains; optimal control;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Decision and Control and European Control Conference (CDC-ECC), 2011 50th IEEE Conference on

Conference_Location :

Orlando, FL

ISSN :

0743-1546

Print_ISBN :

978-1-61284-800-6

Electronic_ISBN :

0743-1546

Type :

conf

DOI :

10.1109/CDC.2011.6161310

Filename :

6161310

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3443304