مرکز منطقه ای اطلاع رساني علوم و فناوري - Improving reinforcement learning algorithms by the use of data mining techniques for feature and action selection

DocumentCode :

3186477

Title :

Improving reinforcement learning algorithms by the use of data mining techniques for feature and action selection

Author :

de L Vieira, Davi C ; Adeodato, Paulo J L ; Gonçalves, Paulo M., Jr.

Author_Institution :

Centro de Inf., Univ. Fed. de Pernambuco, Recife, Brazil

fYear :

2010

fDate :

10-13 Oct. 2010

Firstpage :

1863

Lastpage :

1870

Abstract :

Data mining can be seen as an area of artificial intelligence that seeks to extract information or patterns from large amounts of data either stored in databases or flowing in streams. The main contribution of this work is to present how LVF data mining technique improves Sarsa(λ) algorithm combined with tile-coding technique by selecting the most relevant features and actions from reinforcement learning environments. The objective of this selection is to reduce the complexity of the problem and the amount of memory used by the agent thus leading to faster convergence. The motivation of this work was inspired by the rationale behind Occam´s razor, which describes that a complex model tends to be less accurate than another with a lower complexity. The difficulty in using data mining techniques in reinforcement learning environments is due to the lack of data in a database, so this paper proposes a storage schema for states visited and actions performed by the agent. In this study, the selection of features and actions are applied to a specific problem of RoboCup soccer, the dribble. This problem is composed of 20 continuous variables and 113 actions available to the agent which results in a memory consumption of approximately 4.5mb when the traditional Sarsa(λ) algorithm is used combined with the tile-coding technique. The experiments´ results show that the amount of variables in the environment were reduced by 35% and the amount of actions by 65%, which resulted in a reduction in memory consumption of 43% and an increase in performance of up to 23%, according to the relative frequency distribution of agent´s success. The approach proposed here is both easy to use and efficient.

Keywords :

data mining; learning (artificial intelligence); mobile robots; multi-agent systems; multi-robot systems; software agents; RoboCup soccer problem; Sarsa algorithm; action selection; artificial intelligence; data mining; feature and action selection; feature selection; reinforcement learning algorithm; tile coding technique; Lead; Training; Variable speed drives; Data Mining; Feature and Action Selection; Intelligent Agents; Reinforcement Learning; RoboCup;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Systems Man and Cybernetics (SMC), 2010 IEEE International Conference on

Conference_Location :

Istanbul

ISSN :

1062-922X

Print_ISBN :

978-1-4244-6586-6

Type :

conf

DOI :

10.1109/ICSMC.2010.5642280

Filename :

5642280

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3186477