• DocumentCode
    3186477
  • Title

    Improving reinforcement learning algorithms by the use of data mining techniques for feature and action selection

  • Author

    de L Vieira, Davi C ; Adeodato, Paulo J L ; Gonçalves, Paulo M., Jr.

  • Author_Institution
    Centro de Inf., Univ. Fed. de Pernambuco, Recife, Brazil
  • fYear
    2010
  • fDate
    10-13 Oct. 2010
  • Firstpage
    1863
  • Lastpage
    1870
  • Abstract
    Data mining can be seen as an area of artificial intelligence that seeks to extract information or patterns from large amounts of data either stored in databases or flowing in streams. The main contribution of this work is to present how LVF data mining technique improves Sarsa(λ) algorithm combined with tile-coding technique by selecting the most relevant features and actions from reinforcement learning environments. The objective of this selection is to reduce the complexity of the problem and the amount of memory used by the agent thus leading to faster convergence. The motivation of this work was inspired by the rationale behind Occam´s razor, which describes that a complex model tends to be less accurate than another with a lower complexity. The difficulty in using data mining techniques in reinforcement learning environments is due to the lack of data in a database, so this paper proposes a storage schema for states visited and actions performed by the agent. In this study, the selection of features and actions are applied to a specific problem of RoboCup soccer, the dribble. This problem is composed of 20 continuous variables and 113 actions available to the agent which results in a memory consumption of approximately 4.5mb when the traditional Sarsa(λ) algorithm is used combined with the tile-coding technique. The experiments´ results show that the amount of variables in the environment were reduced by 35% and the amount of actions by 65%, which resulted in a reduction in memory consumption of 43% and an increase in performance of up to 23%, according to the relative frequency distribution of agent´s success. The approach proposed here is both easy to use and efficient.
  • Keywords
    data mining; learning (artificial intelligence); mobile robots; multi-agent systems; multi-robot systems; software agents; RoboCup soccer problem; Sarsa algorithm; action selection; artificial intelligence; data mining; feature and action selection; feature selection; reinforcement learning algorithm; tile coding technique; Lead; Training; Variable speed drives; Data Mining; Feature and Action Selection; Intelligent Agents; Reinforcement Learning; RoboCup;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Systems Man and Cybernetics (SMC), 2010 IEEE International Conference on
  • Conference_Location
    Istanbul
  • ISSN
    1062-922X
  • Print_ISBN
    978-1-4244-6586-6
  • Type

    conf

  • DOI
    10.1109/ICSMC.2010.5642280
  • Filename
    5642280