مرکز منطقه ای اطلاع رساني علوم و فناوري - XCS with computed prediction in continuous multistep environments

DocumentCode :

2730723

Title :

XCS with computed prediction in continuous multistep environments

Author :

Lanzi, Pier Luca ; Loiacono, Daniele ; Wilson, Stewart W. ; Goldberg, David E.

Author_Institution :

Dip. di Elettronica e Informazione, Politecnico di Milano, Italy

Volume :

fYear :

2005

fDate :

2-5 Sept. 2005

Firstpage :

2032

Abstract :

We apply XCS with computed prediction (XCSF) to tackle multistep reinforcement learning problems involving continuous inputs. In essence we use XCSF as a method of generalized reinforcement learning. We show that in domains involving continuous inputs and delayed rewards XCSF can evolve compact populations of accurate maximally general classifiers which represent the optimal solution to the target problem. We compare the performance of XCSF with that of tabular Q-learning adapted to the continuous domains considered here. The results we present show that XCSF can converge much faster than tabular techniques while producing more compact solutions. Our results also suggest that when exploration is less effective in some areas of the problem space, XCSF can exploit effective generalizations to extend the evolved knowledge beyond the frequently explored areas. In contrast, in the same situations, the convergence speed of tabular Q-learning worsens.

Keywords :

convergence; evolutionary computation; generalisation (artificial intelligence); learning (artificial intelligence); optimisation; pattern classification; XCSF; computed prediction; generalizations; multistep reinforcement learning problems; tabular Q-learning; Boolean functions; Delay; Genetic algorithms; Genetic engineering; Laboratories; Learning; Piecewise linear approximation; Piecewise linear techniques; Polynomials; System testing;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Evolutionary Computation, 2005. The 2005 IEEE Congress on

Print_ISBN :

0-7803-9363-5

Type :

conf

DOI :

10.1109/CEC.2005.1554945

Filename :

1554945

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2730723