مرکز منطقه ای اطلاع رساني علوم و فناوري - Toward an optimized value iteration algorithm for average cost Markov decision processes

DocumentCode :

2579845

Title :

Toward an optimized value iteration algorithm for average cost Markov decision processes

Author :

Arruda, Edilson F. ; Ourique, Fabrício ; Almudevar, Anthony

Author_Institution :

Sch. of Eng., Pontifical Catholic Univ. of Rio Grande do Sul, Porto Alegre, Brazil

fYear :

2010

fDate :

15-17 Dec. 2010

Firstpage :

930

Lastpage :

934

Abstract :

In this paper we propose a technique to accelerate the convergence rate of the value iteration (VI) algorithm applied to discrete average cost Markov decision processes (MDP). The convergence rate is measured with respect to the total computational effort instead of the iteration counter. Such a rate definition makes it possible to compare different classes of algorithms, which employ distinct and possibly variable updating schemes. A partial information value iteration (PIVI) algorithm is proposed that updates an increasingly accurate approximate version of the original problem with a view toward saving computations at the early stages of the algorithm, when one is typically far from the optimal solution. The PIVI overall computational effort is compared with that of the classical VI algorithm for a broad set of parameters. The results suggest that a suitable choice of parameters can lead to significant computational savings in the process of finding the optimal solution for discrete MDP under the average cost criterion.

Keywords :

Markov processes; costing; decision theory; iterative methods; optimisation; average cost Markov decision process; convergence rate; iteration counter; optimized value iteration algorithm; partial information value iteration algorithm; Approximation algorithms; Computational modeling; Convergence; Cost function; Dynamic programming; Markov processes; Signal processing algorithms; Average Cost; Computational Effort; Markov Decision Processes; Value Iteration;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Decision and Control (CDC), 2010 49th IEEE Conference on

Conference_Location :

Atlanta, GA

ISSN :

0743-1546

Print_ISBN :

978-1-4244-7745-6

Type :

conf

DOI :

10.1109/CDC.2010.5717895

Filename :

5717895

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2579845