مرکز منطقه ای اطلاع رساني علوم و فناوري - Backpropagation Modification in Monte-Carlo Game Tree Search

DocumentCode :

2920534

Title :

Backpropagation Modification in Monte-Carlo Game Tree Search

Author :

Xie, Fan ; Liu, Zhiqing

Author_Institution :

Jiu-Ding Comput. Go Res. Inst., Beijing Univ. of Posts & Telecommun., Beijing, China

Volume :

fYear :

2009

fDate :

21-22 Nov. 2009

Firstpage :

125

Lastpage :

128

Abstract :

The Algorithm UCT, proposed by Kocsys et al, which apply multi-armed bandit problem into the tree-structured search space, achieves some remarkable success in some challenging fields. For UCT algorithm, Monte-Carlo simulations are performed with the guidance of UCB1 formula, which are averaged to evaluate a specified action. We observe that, as more simulations are performed, later ones usually lead to more accurate results, partly because the level of the search used in the later simulation is deeper and partly because more results are available to direct subsequent simulations. This paper presents a new method to improve the performance of UCT algorithm by increasing the feedback value of the later simulations. And the experimental results in the classical game Go show that our approach increases the performance of Monte-Carlo simulations significantly when exponential models are used.

Keywords :

Monte Carlo methods; backpropagation; games of skill; tree searching; Monte Carlo game tree search; backpropagation modification; classical game Go; exponential models; multi armed bandit problem; tree structured search space; Application software; Backpropagation algorithms; Computational modeling; Distributed computing; Feedback; Financial advantage program; Information technology; Machine learning; Performance evaluation; Telecommunication computing; machine learning; monte-carlo tree search; weight factor;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Intelligent Information Technology Application, 2009. IITA 2009. Third International Symposium on

Conference_Location :

Nanchang

Print_ISBN :

978-0-7695-3859-4

Type :

conf

DOI :

10.1109/IITA.2009.331

Filename :

5369603

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2920534