مرکز منطقه ای اطلاع رساني علوم و فناوري - Standard dynamic programming applied to time aggregated Markov decision processes

DocumentCode :

3314421

Title :

Standard dynamic programming applied to time aggregated Markov decision processes

Author :

Arruda, Edilson F. ; Fragoso, Marcelo D.

Author_Institution :

Sch. of Eng., Pontifical Catholic Univ. of Rio Grande do Sul, Porto Alegre, Brazil

fYear :

2009

fDate :

15-18 Dec. 2009

Firstpage :

2576

Lastpage :

2580

Abstract :

In this note we address the time aggregation approach to ergodic finite state Markov decision processes with uncontrollable states. We propose the use of the time aggregation approach as an intermediate step toward constructing a transformed MDP whose state space is comprised solely of the controllable states. The proposed approach simplifies the iterative search for the optimal solution by eliminating the need to define an equivalent parametric function, and results in a problem that can be solved by simpler, standard MDP algorithms.

Keywords :

Markov processes; decision theory; dynamic programming; ergodic finite state Markov decision process; iterative search; optimal solution; parametric function; standard dynamic programming; time aggregated Markov decision process; Costs; Decision making; Dynamic programming; Iterative algorithms; Iterative methods; Learning; Process control; Random processes; Solid modeling; State-space methods; Dynamic Programing; Markov Decision Processes; Time Aggregation;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Decision and Control, 2009 held jointly with the 2009 28th Chinese Control Conference. CDC/CCC 2009. Proceedings of the 48th IEEE Conference on

Conference_Location :

Shanghai

ISSN :

0191-2216

Print_ISBN :

978-1-4244-3871-6

Electronic_ISBN :

0191-2216

Type :

conf

DOI :

10.1109/CDC.2009.5400692

Filename :

5400692

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3314421