Title :
A neural network solution for exploitation and exploration problems
Author :
Hwang, Kao-Shing ; Chiou, Jeng-Yih ; Wu, Cheng-Shong
Author_Institution :
Dept. of Electr. & Eng., Nat. Chung Chen Univ., Ming-Hsiung, Taiwan
Abstract :
A reinforcement learning (RL) architecture using artificial neural networks (ANN) is proposed. The proposed method applies not only the exploitation mechanism but also the exploration mechanism to a partially unstructured environment and generates actions with learning capability. The exploitation capability is achieved by a normal distribution function with mean denoting a best action in the current state. Variances of the distribution function modified by a three-layer ANN indicate the range of exploring. The weights of this ANN are modified by an evaluated internal reinforcement signal so as to accumulating learning experiences. Meanwhile, the other two of ANNs approximate policy selection functions and estimate the state evaluation, respectively. They learn from by a reinforcement learning method. The performance of controlling a pendulum system is compared to one of the adaptive heuristic critic (AHC). The results show that the performance of proposed method is better than the AHC´s.
Keywords :
heuristic programming; learning (artificial intelligence); neural nets; normal distribution; adaptive heuristic critic; artificial neural networks; exploitation capability; exploration mechanism; normal distribution function; reinforcement learning architecture; Adaptive control; Artificial neural networks; Control systems; Convergence; Distribution functions; Gaussian distribution; Neural networks; Programmable control; State estimation; Working environment noise;
Conference_Titel :
Neural Networks and Signal Processing, 2003. Proceedings of the 2003 International Conference on
Conference_Location :
Nanjing
Print_ISBN :
0-7803-7702-8
DOI :
10.1109/ICNNSP.2003.1279211