Title :
Board Representations for Neural Go Players Learning by Temporal Difference
Author :
Mayer, Helmut A.
Author_Institution :
Dept. of Comput. Sci., Salzburg Univ.
Abstract :
The majority of work on artificial neural networks (ANNs) playing the game of Go focus on network architectures and training regimes to improve the quality of the neural player. A less investigated problem is the board representation conveying the information on the current state of the game to the network. Common approaches suggest a straight-forward encoding by assigning each point on the board to a single (or more) input neurons. However, these basic representations do not capture elementary structural relationships between stones (and points) being essential to the game. We compare three different board representations for self-learning ANNs on a 5 times 5 board employing temporal difference learning (TDL) with two types of move selection (during training). The strength of the trained networks is evaluated in games against three computer players of different quality. A tournament of the best neural players, addition of alpha-beta search, and a commented game of a neural player against the best computer player further explore the potential of the neural players and its respective board representations
Keywords :
computer games; neural nets; temporal reasoning; unsupervised learning; artificial neural network; board representations; game playing; network architecture; neural Go player; self-learning ANN; temporal difference learning; Artificial neural networks; Computational intelligence; Computer architecture; Computer networks; Encoding; Game theory; Humans; Intelligent networks; Neural networks; Neurons; Artificial Neural Networks; Board Representation; Game of Go; Temporal Difference Learning;
Conference_Titel :
Computational Intelligence and Games, 2007. CIG 2007. IEEE Symposium on
Conference_Location :
Honolulu, HI
Print_ISBN :
1-4244-0709-5
DOI :
10.1109/CIG.2007.368096