مرکز منطقه ای اطلاع رساني علوم و فناوري

DocumentCode :

468172

Title :

Improving the Convergence of RTDP

Author :

Liang, Tian ; Sun, Jigui ; Yin, Minghao

Author_Institution :

Jilin Univ., Changchun

Volume :

fYear :

2007

fDate :

24-27 Aug. 2007

Firstpage :

613

Lastpage :

617

Abstract :

Real-time dynamic programming (RTDP) is an outstanding real-time algorithm for solving non-deterministic planning problems with full observability. RTDP has two key advantages comparing with other DP algorithms: first, it obtain an optimal policy without computing the whole space, second, it has a good anytime behavior. However, RTDP\´s convergence is slow. In this paper, we introduce RTDP(k), an algorithm based on RTDP with a similar structure. RTDP(k) improves the convergence as well as holds real-time algorithm properties. RTDP(k) updates k extended states per iteration following a "bounded propagation strategy". In Markov decision processes (MDPs), especially in stochastic shortest-path problems (SSPs), we have proved two points: first, every RTDP(k) trial terminates in a finite number of steps ,second, RTDP(k) eventually converges to an optimal policy. From a practical point of view, We show that RTDP(k) produces better solutions in the first trial and converges faster than RTDP on benchmarks for real-time search.

Keywords :

Markov processes; decision theory; dynamic programming; graph theory; iterative methods; planning; search problems; stochastic programming; Markov decision pocesses; bounded propagation strategy; iteration; nondeterministic planning problems; optimal policy; ral-time dynamic programming; real-time search; stochastic shortest-path problems; Convergence; Cost function; Dynamic programming; Educational institutions; Labeling; Observability; Probability distribution; State-space methods; Stochastic processes; Sun;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Fuzzy Systems and Knowledge Discovery, 2007. FSKD 2007. Fourth International Conference on

Conference_Location :

Haikou

Print_ISBN :

978-0-7695-2874-8

Type :

conf

DOI :

10.1109/FSKD.2007.360

Filename :

4405997

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=468172