• DocumentCode
    2053589
  • Title

    Increasing the flexibility and speed of convergence of a learning agent

  • Author

    Santibanez, Miguel A Soto ; Marefat, Michael M.

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Arizona Univ., Tucson, AZ, USA
  • Volume
    3
  • fYear
    2001
  • fDate
    2001
  • Firstpage
    1748
  • Abstract
    A review of the basic methods used to model a learning agent, such as instance-based learning, artificial neural networks and reinforcement learning, suggests that they either lack flexibility (can only be used to solve a small number of problems) or they tend to converge very slowly to the optimal policy. This paper describes and illustrates a set of processes that address these two shortcomings. The resulting learning agent is able to "adapt fairly well" to a much larger set of environments and is capable of doing this in a reasonable amount of time. In order to address the lack of flexibility and slow convergence to the optimal policy, the new learning agent becomes a hybrid between a learning agent based on instance-based learning and one based on reinforcement learning. To accelerate its convergence to its optimal policy, this new learning agent incorporates the use of a new concept we call propagation of good findings. Furthermore, to make a better use of the learning agent\´s memory resources,, and therefore increase its flexibility, we make use of another new concept we call moving prototypes
  • Keywords
    learning (artificial intelligence); neural nets; software agents; artificial neural networks; instance-based learning; learning agent; moving prototypes; reinforcement learning; slow convergence; speed of convergence; Accelerated aging; Artificial neural networks; Computer networks; Convergence; Learning; Leg; Neural networks; Prototypes; Robustness; Training data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Systems, Man, and Cybernetics, 2001 IEEE International Conference on
  • Conference_Location
    Tucson, AZ
  • ISSN
    1062-922X
  • Print_ISBN
    0-7803-7087-2
  • Type

    conf

  • DOI
    10.1109/ICSMC.2001.973538
  • Filename
    973538