DocumentCode :
3250170
Title :
Why co-evolution beats temporal difference learning at Backgammon for a linear architecture, but not a non-linear architecture
Author :
Darwen, Paul J.
Author_Institution :
Dept. of Comput. Sci. & Electr. Eng., Queensland Univ., Brisbane, Qld., Australia
Volume :
2
fYear :
2001
fDate :
2001
Firstpage :
1003
Abstract :
No Free Lunch theorems show that the algorithm must suit the problem. This does not answer the novice´s question: for a given problem, which algorithm to use? This paper compares co-evolutionary learning and temporal difference learning on the game of Backgammon, which (like many real-world tasks) has an element of random uncertainty. Unfortunately, to fully evaluate a single strategy using undirected sampling of board positions, using only random dice rolls, requires a great deal of computation. Evolution´s all-or-nothing replacement of entire solutions needs accurate evaluation, but relatively rare board positions are needed to train above a certain level. Temporal difference learning, with its incremental changes, does not use such an all-or-nothing approach. These results have relevance to a variety of real-world tasks with uncertainty, such as schedule optimization
Keywords :
computer games; evolutionary computation; games of skill; learning (artificial intelligence); Backgammon; Free Lunch theorems; all-or-nothing approach; co-evolutionary learning; game; linear architecture; nonlinear architecture; random uncertainty; schedule optimization; temporal difference learning; Cognitive science; Computer architecture; Computer science; Law; Legal factors; Neural networks; Optimal scheduling; Sampling methods; Scheduling algorithm; Uncertainty;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Evolutionary Computation, 2001. Proceedings of the 2001 Congress on
Conference_Location :
Seoul
Print_ISBN :
0-7803-6657-3
Type :
conf
DOI :
10.1109/CEC.2001.934300
Filename :
934300
Link To Document :
بازگشت