مرکز منطقه ای اطلاع رساني علوم و فناوري - Reinforcement learning and the effects of parameter settings in the game of Chung Toi

DocumentCode :

2386494

Title :

Reinforcement learning and the effects of parameter settings in the game of Chung Toi

Author :

Gatti, Christopher J. ; Embrechts, Mark J. ; Linton, Jonathan D.

Author_Institution :

Dept. of Ind. & Syst. Eng., Rensselaer Polytech. Inst., Troy, NY, USA

fYear :

2011

fDate :

9-12 Oct. 2011

Firstpage :

3530

Lastpage :

3535

Abstract :

This work applied reinforcement learning and the temporal difference TD(λ) algorithm to train a neural network to play the game of Chung Toi, a challenging variant of Tic-Tac-Toe. The effects of changing parameters and settings of the TD(λ) and of the neural network were evaluated by observing the ability of the network to learn the game of Chung Toi and play against a `smart´ random player. This work applied techniques that have proven effective in training neural networks in general to the TD(λ) algorithm. The basic implementation of the TD(λ) method resulted in stable performance and achieved a maximal performance of winning 90.4% of evaluation games. When changing parameter settings, the best performance was achieved by using different learning rates between layers in the neural network (92.6% wins), and this was followed by using a relatively high probability of action exploitation (91.8% wins).

Keywords :

computer games; learning (artificial intelligence); neural nets; probability; Chung Toi game; Tic-Tac-Toe; action exploitation; evaluation game; learning rate; neural network; probability; reinforcement learning; temporal difference algorithm; Annealing; Games; Learning; Neural networks; Training; Transfer functions; Vectors; board game; neural network; reinforcement learning; temporal difference;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Systems, Man, and Cybernetics (SMC), 2011 IEEE International Conference on

Conference_Location :

Anchorage, AK

ISSN :

1062-922X

Print_ISBN :

978-1-4577-0652-3

Type :

conf

DOI :

10.1109/ICSMC.2011.6084216

Filename :

6084216

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2386494