DocumentCode :
1181051
Title :
Gradient descent methods in learning classifier systems: improving XCS performance in multistep problems
Author :
Butz, Martin V. ; Goldberg, David E. ; Lanzi, Pier Luca
Author_Institution :
Dept. of Gen. Eng., Univ. of Illinois, Urbana, IL, USA
Volume :
9
Issue :
5
fYear :
2005
Firstpage :
452
Lastpage :
473
Abstract :
The accuracy-based XCS classifier system has been shown to solve typical data mining problems in a machine-learning competitive way. However, successful applications in multistep problems, modeled by a Markov decision process, were restricted to very small problems. Until now, the temporal difference learning technique in XCS was based on deterministic updates. However, since a prediction is actually generated by a set of rules in XCS and Learning Classifier Systems in general, gradient-based update methods are applicable. The extension of XCS to gradient-based update methods results in a classifier system that is more robust and more parameter independent, solving large and difficult maze problems reliably. Additionally, the extension to gradient methods highlights the relation of XCS to other function approximation methods in reinforcement learning.
Keywords :
Markov processes; data mining; evolutionary computation; function approximation; gradient methods; learning (artificial intelligence); Markov decision process; XCS performance; data mining; evolutionary computation; function approximation method; gradient descent method; learning classifier system; machine learning; multistep problem; reinforcement learning; Data mining; Evolutionary computation; Function approximation; Genetic algorithms; Genetic engineering; Government; Laboratories; Learning systems; Robustness; Zero current switching; Function approximation; Q-learning; XCS; gradient descent; learning classifier systems (LCSs); multistop problems; reinforcement learning;
fLanguage :
English
Journal_Title :
Evolutionary Computation, IEEE Transactions on
Publisher :
ieee
ISSN :
1089-778X
Type :
jour
DOI :
10.1109/TEVC.2005.850265
Filename :
1514471
Link To Document :
بازگشت