• DocumentCode
    406110
  • Title

    A neural network solution for exploitation and exploration problems

  • Author

    Hwang, Kao-Shing ; Chiou, Jeng-Yih ; Wu, Cheng-Shong

  • Author_Institution
    Dept. of Electr. & Eng., Nat. Chung Chen Univ., Ming-Hsiung, Taiwan
  • Volume
    1
  • fYear
    2003
  • fDate
    14-17 Dec. 2003
  • Firstpage
    54
  • Abstract
    A reinforcement learning (RL) architecture using artificial neural networks (ANN) is proposed. The proposed method applies not only the exploitation mechanism but also the exploration mechanism to a partially unstructured environment and generates actions with learning capability. The exploitation capability is achieved by a normal distribution function with mean denoting a best action in the current state. Variances of the distribution function modified by a three-layer ANN indicate the range of exploring. The weights of this ANN are modified by an evaluated internal reinforcement signal so as to accumulating learning experiences. Meanwhile, the other two of ANNs approximate policy selection functions and estimate the state evaluation, respectively. They learn from by a reinforcement learning method. The performance of controlling a pendulum system is compared to one of the adaptive heuristic critic (AHC). The results show that the performance of proposed method is better than the AHC´s.
  • Keywords
    heuristic programming; learning (artificial intelligence); neural nets; normal distribution; adaptive heuristic critic; artificial neural networks; exploitation capability; exploration mechanism; normal distribution function; reinforcement learning architecture; Adaptive control; Artificial neural networks; Control systems; Convergence; Distribution functions; Gaussian distribution; Neural networks; Programmable control; State estimation; Working environment noise;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Neural Networks and Signal Processing, 2003. Proceedings of the 2003 International Conference on
  • Conference_Location
    Nanjing
  • Print_ISBN
    0-7803-7702-8
  • Type

    conf

  • DOI
    10.1109/ICNNSP.2003.1279211
  • Filename
    1279211