DocumentCode :
2239102
Title :
Pruning in UCT Algorithm
Author :
Huang, Jing ; Liu, Zhiqing ; Lu, Benjie ; Xiao, Feng
Author_Institution :
BUPT-JD Inst. of Comput. GO, Beijing Univ. of Posts & Telecommun., Beijing, China
fYear :
2010
fDate :
18-20 Nov. 2010
Firstpage :
177
Lastpage :
181
Abstract :
UCT is a Monte-Carlo planning algorithm that, with in a given amount of time, computes near-optimal solutions for Markovian decision processes of large state spaces. It has gained much attention from there search community and been used in many applications since its publication in 2006, because of its significant improvement of the effectiveness of Monte-Carlo planning computation. This paper proposes a modification of the UCT algorithm, which can prune certain Markovian decision process actions and their associated states during the Monte-Carlo planning computation. The pruning of actions and states is performed based on properties of underlying UCB algorithms of UCT. This paper proves that it is highly unlikely for the pruned actions and states to be in the solution path returned by the UCT algorithm, making the pruning modification almost just as good as the original algorithm. Additionally, the pruning modification may reduce the size of the Markovian decision process state space, and thus improves the effectiveness of the original algorithm. Experimental results in computer GO demonstrate the effectiveness of pruning in the UCT algorithm.
Keywords :
Markov processes; Monte Carlo methods; algorithm theory; decision theory; decision trees; planning (artificial intelligence); Markovian decision process state space; Monte-Carlo planning algorithm; UCT algorithm; near optimal solution; pruning; Monte-Carlo planning; UCT algorithm; pruning condition; territorial information;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Technologies and Applications of Artificial Intelligence (TAAI), 2010 International Conference on
Conference_Location :
Hsinchu City
Print_ISBN :
978-1-4244-8668-7
Electronic_ISBN :
978-0-7695-4253-9
Type :
conf
DOI :
10.1109/TAAI.2010.38
Filename :
5695450
Link To Document :
بازگشت