DocumentCode
3683574
Title
Convergence and correctness analysis of Monte-Carlo tree search algorithms: A case study of 2 by 4 Chinese dark chess
Author
Hung-Jui Chang;Chih-Wen Hsueh;Tsan-sheng Hsu
Author_Institution
Depart of Computer Science and Information Engineering, National Taiwan University, Taipei 106, Taiwan
fYear
2015
Firstpage
260
Lastpage
266
Abstract
The convergence and correctness of the UCT algorithm is a hot research problem. Previous research has shown the convergence of UCT-based algorithm on simultaneous turns or random turns games, but little is known for traditional alternating turns games. In this paper, we analyze the performance of the UCT algorithm in a 2-player imperfect information alternating turns game, 2 × 4 Chinese dark chess (CDC). The performance of the UCT algorithm is measured by the correctness rate and convergence speed. The correctness is defined by the percentage that the UCT algorithm outputs the same move with the one having the best game theoretic value. The convergence speed is defined by the entropy of a set of moves, which are output by the UCT algorithm using the same number of iterations. Experimental result shows the convergence of the UCT algorithm in the CDC, which can also be applied to explain the existence of diminishing returns in the UCT algorithm. Experimental result also shows an UCT algorithm does not always output moves with the best game theoretic value, but these with the highest chance of winning.
Keywords
"Games","Convergence","Computational modeling","Nickel","Probability","Monte Carlo methods","Entropy"
Publisher
ieee
Conference_Titel
Computational Intelligence and Games (CIG), 2015 IEEE Conference on
ISSN
2325-4270
Electronic_ISBN
2325-4289
Type
conf
DOI
10.1109/CIG.2015.7317963
Filename
7317963
Link To Document