Convergence and correctness analysis of Monte-Carlo tree search algorithms: A case study of 2 by 4 Chinese dark chess

Author

Hung-Jui Chang;Chih-Wen Hsueh;Tsan-sheng Hsu

Author_Institution

Depart of Computer Science and Information Engineering, National Taiwan University, Taipei 106, Taiwan

fYear

2015

Firstpage

260

Lastpage

266

Abstract

The convergence and correctness of the UCT algorithm is a hot research problem. Previous research has shown the convergence of UCT-based algorithm on simultaneous turns or random turns games, but little is known for traditional alternating turns games. In this paper, we analyze the performance of the UCT algorithm in a 2-player imperfect information alternating turns game, 2 × 4 Chinese dark chess (CDC). The performance of the UCT algorithm is measured by the correctness rate and convergence speed. The correctness is defined by the percentage that the UCT algorithm outputs the same move with the one having the best game theoretic value. The convergence speed is defined by the entropy of a set of moves, which are output by the UCT algorithm using the same number of iterations. Experimental result shows the convergence of the UCT algorithm in the CDC, which can also be applied to explain the existence of diminishing returns in the UCT algorithm. Experimental result also shows an UCT algorithm does not always output moves with the best game theoretic value, but these with the highest chance of winning.

Keywords

"Games","Convergence","Computational modeling","Nickel","Probability","Monte Carlo methods","Entropy"

Publisher

ieee

Conference_Titel

Computational Intelligence and Games (CIG), 2015 IEEE Conference on

ISSN

2325-4270

Electronic_ISBN

2325-4289

Type

conf

DOI

10.1109/CIG.2015.7317963

Filename

7317963