مرکز منطقه ای اطلاع رساني علوم و فناوري - Why co-evolution beats temporal difference learning at Backgammon for a linear architecture, but not a non-linear architecture

DocumentCode :

3250170

Title :

Why co-evolution beats temporal difference learning at Backgammon for a linear architecture, but not a non-linear architecture

Author :

Darwen, Paul J.

Author_Institution :

Dept. of Comput. Sci. & Electr. Eng., Queensland Univ., Brisbane, Qld., Australia

Volume :

fYear :

2001

fDate :

2001

Firstpage :

1003

Abstract :

No Free Lunch theorems show that the algorithm must suit the problem. This does not answer the novice´s question: for a given problem, which algorithm to use? This paper compares co-evolutionary learning and temporal difference learning on the game of Backgammon, which (like many real-world tasks) has an element of random uncertainty. Unfortunately, to fully evaluate a single strategy using undirected sampling of board positions, using only random dice rolls, requires a great deal of computation. Evolution´s all-or-nothing replacement of entire solutions needs accurate evaluation, but relatively rare board positions are needed to train above a certain level. Temporal difference learning, with its incremental changes, does not use such an all-or-nothing approach. These results have relevance to a variety of real-world tasks with uncertainty, such as schedule optimization

Keywords :

computer games; evolutionary computation; games of skill; learning (artificial intelligence); Backgammon; Free Lunch theorems; all-or-nothing approach; co-evolutionary learning; game; linear architecture; nonlinear architecture; random uncertainty; schedule optimization; temporal difference learning; Cognitive science; Computer architecture; Computer science; Law; Legal factors; Neural networks; Optimal scheduling; Sampling methods; Scheduling algorithm; Uncertainty;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Evolutionary Computation, 2001. Proceedings of the 2001 Congress on

Conference_Location :

Seoul

Print_ISBN :

0-7803-6657-3

Type :

conf

DOI :

10.1109/CEC.2001.934300

Filename :

934300

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3250170