DocumentCode :
2386494
Title :
Reinforcement learning and the effects of parameter settings in the game of Chung Toi
Author :
Gatti, Christopher J. ; Embrechts, Mark J. ; Linton, Jonathan D.
Author_Institution :
Dept. of Ind. & Syst. Eng., Rensselaer Polytech. Inst., Troy, NY, USA
fYear :
2011
fDate :
9-12 Oct. 2011
Firstpage :
3530
Lastpage :
3535
Abstract :
This work applied reinforcement learning and the temporal difference TD(λ) algorithm to train a neural network to play the game of Chung Toi, a challenging variant of Tic-Tac-Toe. The effects of changing parameters and settings of the TD(λ) and of the neural network were evaluated by observing the ability of the network to learn the game of Chung Toi and play against a `smart´ random player. This work applied techniques that have proven effective in training neural networks in general to the TD(λ) algorithm. The basic implementation of the TD(λ) method resulted in stable performance and achieved a maximal performance of winning 90.4% of evaluation games. When changing parameter settings, the best performance was achieved by using different learning rates between layers in the neural network (92.6% wins), and this was followed by using a relatively high probability of action exploitation (91.8% wins).
Keywords :
computer games; learning (artificial intelligence); neural nets; probability; Chung Toi game; Tic-Tac-Toe; action exploitation; evaluation game; learning rate; neural network; probability; reinforcement learning; temporal difference algorithm; Annealing; Games; Learning; Neural networks; Training; Transfer functions; Vectors; board game; neural network; reinforcement learning; temporal difference;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Systems, Man, and Cybernetics (SMC), 2011 IEEE International Conference on
Conference_Location :
Anchorage, AK
ISSN :
1062-922X
Print_ISBN :
978-1-4577-0652-3
Type :
conf
DOI :
10.1109/ICSMC.2011.6084216
Filename :
6084216
Link To Document :
بازگشت