مرکز منطقه ای اطلاع رساني علوم و فناوري - TD methods applied to mixture of experts for learning 9×9 Go evaluation function

DocumentCode :

1802617

Title :

TD methods applied to mixture of experts for learning 9×9 Go evaluation function

Author :

Zaman, Raonak ; Wunsch, Donald C., II

Author_Institution :

Appl. Comput. Intelligence Lab., Texas Tech. Univ., Lubbock, TX, USA

Volume :

fYear :

1999

fDate :

36342

Firstpage :

3734

Abstract :

The temporal difference (TD) method is applied on a committee of neural network experts to learn the board evaluation function for the oriental board game Go. The game has simple rules but requires complex strategies to play well, and the conventional tree search algorithm for computer games makes a poor Go program. Thus, the game Go is an ideal problem domain for exploring machine learning algorithms. Here, the neural networks learned a board evaluation function for Go played on 9×9 board sizes. Two learning algorithms, e.g., hybrid mixture of experts (HME) and Meta-Pi, are used to train the neural network experts. Both algorithms learned good Go evaluation functions and the neural network based Go engines were able to defeat a public domain rule-based program more than 50% of the times. The performances of the mixture networks are compared with that of a single feedforward network trained similarly

Keywords :

computer games; games of skill; learning (artificial intelligence); neural nets; 9×9 Go evaluation function learning; HME; Meta-Pi; TD methods; board evaluation function; board game Go; feedforward network; hybrid expert mixture; machine learning algorithms; neural network expert committee; temporal difference method; tree search algorithm; Books; Computational intelligence; Computer networks; Engines; Laboratories; Law; Legal factors; Machine learning algorithms; Neural networks; Predictive models;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Neural Networks, 1999. IJCNN '99. International Joint Conference on

Conference_Location :

Washington, DC

ISSN :

1098-7576

Print_ISBN :

0-7803-5529-6

Type :

conf

DOI :

10.1109/IJCNN.1999.830746

Filename :

830746

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1802617