Title :
Monte Carlo Tree Search with heuristic evaluations using implicit minimax backups
Author :
Lanctot, Marc ; Winands, Mark H. M. ; Pepels, Tom ; Sturtevant, Nathan R.
Author_Institution :
Dept. of Knowledge Eng., Maastricht Univ., Maastricht, Netherlands
Abstract :
Monte Carlo Tree Search (MCTS) has improved the performance of game engines in domains such as Go, Hex, and general game playing. MCTS has been shown to outperform classic αβ search in games where good heuristic evaluations are difficult to obtain. In recent years, combining ideas from traditional minimax search in MCTS has been shown to be advantageous in some domains, such as Lines of Action, Amazons, and Breakthrough. In this paper, we propose a new way to use heuristic evaluations to guide the MCTS search by storing the two sources of information, estimated win rates and heuristic evaluations, separately. Rather than using the heuristic evaluations to replace the playouts, our technique backs them up implicitly during the MCTS simulations. These minimax values are then used to guide future simulations. We show that using implicit minimax backups leads to stronger play performance in Kalah, Breakthrough, and Lines of Action.
Keywords :
Monte Carlo methods; computer games; minimax techniques; tree searching; αβ search; Amazons; Breakthrough; Lines of Action; MCTS simulations; Monte Carlo tree search; game engine performance; game playing; heuristic evaluations; implicit minimax backups; minimax search; Games; Silicon;
Conference_Titel :
Computational Intelligence and Games (CIG), 2014 IEEE Conference on
Conference_Location :
Dortmund
DOI :
10.1109/CIG.2014.6932903