مرکز منطقه ای اطلاع رساني علوم و فناوري - Temporal Difference Learning Versus Co-Evolution for Acquiring Othello Position Evaluation

DocumentCode :

3497537

Title :

Temporal Difference Learning Versus Co-Evolution for Acquiring Othello Position Evaluation

Author :

Lucas, Simon M. ; Runarsson, Thomas P.

Author_Institution :

Dept. of Comput. Sci., Essex Univ., Colchester

fYear :

2006

fDate :

22-24 May 2006

Firstpage :

Lastpage :

Abstract :

This paper compares the use of temporal difference learning (TDL) versus co-evolutionary learning (CEL) for acquiring position evaluation functions for the game of Othello. The paper provides important insights into the strengths and weaknesses of each approach. The main findings are that for Othello, TDL learns much faster than CEL, but that properly tuned CEL can learn better playing strategies. For CEL, it is essential to use parent-child weighted averaging in order to achieve good performance. Using this method a high quality weighted piece counter was evolved, and was shown to significantly outperform a set of standard heuristic weights

Keywords :

computer games; evolutionary computation; games of skill; learning (artificial intelligence); Othello position evaluation; coevolutionary learning; parent-child weighted averaging; temporal difference learning; weighted piece counter evolution; Computer science; Counting circuits; Explosions; Law; Legal factors; Minimax techniques; Reflection; Othello; co-evolution; temporal difference learning;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Computational Intelligence and Games, 2006 IEEE Symposium on

Conference_Location :

Reno, NV

Print_ISBN :

1-4244-0464-9

Type :

conf

DOI :

10.1109/CIG.2006.311681

Filename :

4100108

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3497537