مرکز منطقه ای اطلاع رساني علوم و فناوري - Investigating learning rates for evolution and temporal difference learning

DocumentCode :

1840155

Title :

Investigating learning rates for evolution and temporal difference learning

Author :

Lucas, Simon M.

Author_Institution :

Dept. of Comput. & Electron. Syst., Univ. of Essex, Colchester

fYear :

2008

fDate :

15-18 Dec. 2008

Firstpage :

Lastpage :

Abstract :

Evidently, any learning algorithm can only learn on the basis of the information given to it. This paper presents a first attempt to place an upper bound on the information rates attainable with standard co-evolution and with TDL. The upper bound for TDL is shown to be much higher than for co-evolution. Under commonly used settings for learning to play Othello for example, TDL may have an upper bound that is hundreds or even thousands of times higher than that of coevolution. To test how well these bounds correlate with actual learning rates, a simple two-player game called Treasure Hunt is developed. While the upper bounds cannot be used to predict the number of games required to learn the optimal policy, they do correctly predict the rank order of the number of games required by each algorithm.

Keywords :

computer games; learning (artificial intelligence); Othello; Treasure Hunt; co-evolution; learning rates; temporal difference learning; two-player game; Counting circuits; Evolutionary computation; Information rates; Machine learning; Machine learning algorithms; Neural networks; Robustness; Surges; Testing; Upper bound;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Computational Intelligence and Games, 2008. CIG '08. IEEE Symposium On

Conference_Location :

Perth, WA

Print_ISBN :

978-1-4244-2973-8

Electronic_ISBN :

978-1-4244-2974-5

Type :

conf

DOI :

10.1109/CIG.2008.5035614

Filename :

5035614

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1840155