مرکز منطقه ای اطلاع رساني علوم و فناوري - A neural network solution for exploitation and exploration problems

DocumentCode :

406110

Title :

A neural network solution for exploitation and exploration problems

Author :

Hwang, Kao-Shing ; Chiou, Jeng-Yih ; Wu, Cheng-Shong

Author_Institution :

Dept. of Electr. & Eng., Nat. Chung Chen Univ., Ming-Hsiung, Taiwan

Volume :

fYear :

2003

fDate :

14-17 Dec. 2003

Firstpage :

Abstract :

A reinforcement learning (RL) architecture using artificial neural networks (ANN) is proposed. The proposed method applies not only the exploitation mechanism but also the exploration mechanism to a partially unstructured environment and generates actions with learning capability. The exploitation capability is achieved by a normal distribution function with mean denoting a best action in the current state. Variances of the distribution function modified by a three-layer ANN indicate the range of exploring. The weights of this ANN are modified by an evaluated internal reinforcement signal so as to accumulating learning experiences. Meanwhile, the other two of ANNs approximate policy selection functions and estimate the state evaluation, respectively. They learn from by a reinforcement learning method. The performance of controlling a pendulum system is compared to one of the adaptive heuristic critic (AHC). The results show that the performance of proposed method is better than the AHC´s.

Keywords :

heuristic programming; learning (artificial intelligence); neural nets; normal distribution; adaptive heuristic critic; artificial neural networks; exploitation capability; exploration mechanism; normal distribution function; reinforcement learning architecture; Adaptive control; Artificial neural networks; Control systems; Convergence; Distribution functions; Gaussian distribution; Neural networks; Programmable control; State estimation; Working environment noise;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Neural Networks and Signal Processing, 2003. Proceedings of the 2003 International Conference on

Conference_Location :

Nanjing

Print_ISBN :

0-7803-7702-8

Type :

conf

DOI :

10.1109/ICNNSP.2003.1279211

Filename :

1279211

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=406110