DocumentCode :
440191
Title :
Hardware implementation of FAST-based reinforcement learning algorithm
Author :
Hwang, Kao-Shing ; Hsu, Yuan-Pao ; Hsieh, His-Wen ; Lin, Hsin-Yi
Author_Institution :
Dept. of Electr., Nat. Chung-Cheng Univ., Taiwan
fYear :
2005
fDate :
28-30 May 2005
Firstpage :
435
Lastpage :
438
Abstract :
A FAST-based (flexible adaptable-size topology) reinforcement learning chip is implemented in this article. Basically, the FAST is an ART-like (adaptive resonance theory) mechanism. The ART is characterized as one of unsupervised learning neural network models, facilitated to solve stability-plasticity dilemma. The chip is a self organizing architecture that consists of three main structures including similarity, learning, and pruning. Dynamically adjusting the size of sensitivity regions of each neuron and adaptively pruning one of the neurons when an input pattern activates more than one neuron, the chip can preserve hardware resources (available neurons) to accommodate more categories. The clustered result by the implemented chip is then sent to an AHC (adaptive heuristic critic) architecture (emulated by a personal computer) to learn to balance an inverted pendulum system, which is also emulated by the personal computer for verifying the implemented architecture.
Keywords :
field programmable gate arrays; learning (artificial intelligence); microcomputers; neural net architecture; pendulums; self-adjusting systems; adaptive heuristic critic architecture; adaptive resonance theory; flexible adaptable-size topology; hardware implementation; inverted pendulum system; reinforcement learning algorithm; self organizing architecture; Computer architecture; Hardware; Microcomputers; Network topology; Neural networks; Neurons; Resonance; Stability; Subspace constraints; Unsupervised learning;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
VLSI Design and Video Technology, 2005. Proceedings of 2005 IEEE International Workshop on
Print_ISBN :
0-7803-9005-9
Type :
conf
DOI :
10.1109/IWVDVT.2005.1504643
Filename :
1504643
Link To Document :
بازگشت