مرکز منطقه ای اطلاع رساني علوم و فناوري - A potential-based method for finite-stage Markov Decision Process

DocumentCode :

2394243

Title :

A potential-based method for finite-stage Markov Decision Process

Author :

Jia, Qing-Shan

Author_Institution :

Dept. of Autom., Tsinghua Univ., Beijing

fYear :

2008

fDate :

11-13 June 2008

Firstpage :

5029

Lastpage :

5034

Abstract :

Finite-stage Markov decision process (MDP) supplies a general framework for many practical problems when only the performance in a finite duration is of interest. Dynamic programming (DP) supplies a general way to find the optimal policies but is usually practically infeasible, due to the exponentially increasing policy space. Approximating the finite-stage MDP by an infinite-stage MDP reduces the search space but usually does not find the optimal stationary policy, due to the approximation error. We develop a method that finds the optimal stationary policies for the finite-stage MDP. The method is based on performance potentials, which can be estimated through sample paths and thus suits practical application.

Keywords :

Markov processes; dynamic programming; dynamic programming; finite-stage Markov decision process; optimal stationary policy; potential-based method; Approximation error; Decision making; Dynamic programming; Educational institutions; Space stations; Space technology; State estimation; State-space methods; USA Councils; Uncertainty; Performance potentials; finite-stage Markov Decision Processes; policy iteration; stationary policy;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

American Control Conference, 2008

Conference_Location :

Seattle, WA

ISSN :

0743-1619

Print_ISBN :

978-1-4244-2078-0

Electronic_ISBN :

0743-1619

Type :

conf

DOI :

10.1109/ACC.2008.4587291

Filename :

4587291

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2394243