Title :
Is XCS Suitable For Problems with Temporal Rewards?
Author :
Tang, Kai Wing ; Jarvis, Ray A.
Author_Institution :
Dept. of Electr. & Comput. Syst. Eng., Monash Univ., Clayton, Vic.
Abstract :
XCS, the accuracy-based classifier system, provides a very brilliant way to merge genetic algorithmic (GA) rule learning and reinforcement learning (RL) methodologies together. This makes it suitable for a wide range of applications where generalisation over decision making states is desirable. Also, its Q-learning-oriented prediction update scheme enables it to handle multi-step problems adequately. This paper reports how the intertwined spirals problem, initially a popular benchmark in classification, was modified by the authors to verify XCS´s suitability for behavioural design of robotic systems. When the results obtained were not as expected, investigations were continued until a rather surprising conclusion was drawn: XCS cannot handle very simple problems if the rewards are temporally-oriented, even if the reward is extremely short-delayed
Keywords :
decision making; generalisation (artificial intelligence); genetic algorithms; learning (artificial intelligence); multi-robot systems; pattern classification; Q-learning-oriented prediction update scheme; accuracy-based classifier system; decision making; genetic algorithmic rule learning; intertwined spirals problem; multistep problem; reinforcement learning; robotic system behavioural design; Application software; Decision making; Genetic algorithms; Genetic engineering; Intelligent robots; Machine learning; Solids; Spirals; Systems engineering and theory; Testing;
Conference_Titel :
Computational Intelligence for Modelling, Control and Automation, 2005 and International Conference on Intelligent Agents, Web Technologies and Internet Commerce, International Conference on
Conference_Location :
Vienna
Print_ISBN :
0-7695-2504-0
DOI :
10.1109/CIMCA.2005.1631478