DocumentCode
2238585
Title
Analysis of UCT algorithm policies in imperfect information game
Author
Jiajia Zhang ; Xuan Wang ; Ling Yang ; Jia Ji ; Dongsheng Zhi
Author_Institution
Shenzhen Grad. Sch., Intell. Comput. Res. Center, Harbin Inst. of Technol., Shenzhen, China
fYear
2012
fDate
Oct. 30 2012-Nov. 1 2012
Firstpage
132
Lastpage
137
Abstract
For the problem of mini-max tree search, Upper Confidence Bound (UCB) algorithm for multi-armed bandit problem has already been extended to algorithm UCT (UCB applied to Trees). It has shown advantages in the search tree with high branching factors and attained a great success in several domains such as Go program. In this paper, exploration and exploitation balance factor (EBF) is introduced as important parameter in UCT policies. Based on a known domain, which is called Siguo game, the performances for the different parameterized policies of UCT algorithm are compared and analysis is provided also. Following, some hypotheses about the cause of the problems are presented. Moreover, the suggested method about adoption and parameterization of UCT policies is provided for different type and characteristics of game problems.
Keywords
computer games; tree searching; EBF; Go program; Siguo game; UCB algorithm; UCB applied to trees; UCT algorithm policies; branching factors; exploitation balance factor; imperfect information game; mini-max tree search; multiarmed bandit problem; upper confidence bound; Algorithm design and analysis; Computers; Educational institutions; Game theory; Games; Monte Carlo methods; Search problems; Computer game; Exploration-exploitation; Imperfect information; Monte-Carlo sampling; UCT;
fLanguage
English
Publisher
ieee
Conference_Titel
Cloud Computing and Intelligent Systems (CCIS), 2012 IEEE 2nd International Conference on
Conference_Location
Hangzhou
Print_ISBN
978-1-4673-1855-6
Type
conf
DOI
10.1109/CCIS.2012.6664383
Filename
6664383
Link To Document