DocumentCode :
349970
Title :
Online EM algorithm for acquiring evaluation function of game Othello through reinforcement learning
Author :
Yoshioka, Taku ; Ishii, Shin
Author_Institution :
Grad. Sch. of Inf. Sci., Nara Inst. of Sci. & Technol., Ikoma, Japan
Volume :
5
fYear :
1999
fDate :
1999
Firstpage :
498
Abstract :
We previously proposed a method (1998) for acquiring a good evaluation function of the game Othello based on min-max reinforcement learning. In the previous work, we employed a gradient descent method for training the normalized Gaussian network (NGnet) that represents the Othello´s evaluation function. However, a lot of games were required to train the NGnet. In this study, we employ the previously proposed online EM algorithm to train the NGnet. The online EM algorithm converges faster than the gradient descent method, and it is suitable for dynamic environments, such as in reinforcement learning tasks. In this article, we introduce a new architecture that is composed of a certain number of NGnets to represent the evaluation function. In this architecture, each of the NGnets is independently trained by the online EM algorithm. Our experiments show that a good evaluation function can be obtained by this new architecture through a smaller number of training games than by the previous scheme
Keywords :
computer games; function approximation; learning (artificial intelligence); neural nets; real-time systems; NGnets; computer games; function approximation; game Othello; normalized Gaussian network; online EM algorithm; reinforcement learning; Approximation algorithms; Computer architecture; Function approximation; Humans; Information science; Laboratories; Neural networks; State-space methods; Supervised learning; Training data;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Systems, Man, and Cybernetics, 1999. IEEE SMC '99 Conference Proceedings. 1999 IEEE International Conference on
Conference_Location :
Tokyo
ISSN :
1062-922X
Print_ISBN :
0-7803-5731-0
Type :
conf
DOI :
10.1109/ICSMC.1999.815602
Filename :
815602
Link To Document :
بازگشت