Title :
A motor learning model based on the basal ganglia in operant conditioning
Author :
Yuanyuan Gao ; Hongjun Song
Author_Institution :
Coll. of Inf. Eng., Zhejiang A&F Univ., Hangzhou, China
fDate :
May 31 2014-June 2 2014
Abstract :
In this paper, a motor learning model based on the basal ganglia (BG) in operant conditioning (OC) using actor-critic (AC) learning is presented. The model has three networks for realizing action selecting function of the BS, such as actor network, critic network and explorer network. Actor network uses a probabilistic fuzzy controller for action selection which is enhanced by the introduction of a probability measure into the learning structure based on OC learning. A tropism mechanism is designed for describing intrinsic motivation which is a key factor for animal learning and it can direct the orientation of the agent learning. Critic network is composed of a multi-layer feedforward network and the learning is enhanced by TD(λ) algorithm and gradient descent algorithm. Explorer network is to settle the conflict between exploration and exploitation. Through the experiments of cognitive experiment, the method endows the mobile robot with the capabilities of learning obstacle avoidance and finding the target actively.
Keywords :
collision avoidance; fuzzy control; gradient methods; learning systems; mobile robots; motion control; multilayer perceptrons; neurocontrollers; probability; OC learning; action selecting function; action selection; active target finding; actor network; actor-critic learning; agent learning orientation; animal learning; basal ganglia; cognitive experiment; critic network; exploitation; explorer network; gradient descent algorithm; intrinsic motivation; learning enhancement; learning structure; mobile robot; motor learning model; multilayer feedforward network; obstacle avoidance learning; operant conditioning; probabilistic fuzzy controller; probability measure; tropism mechanism; Equations; Mice; Neurons; Robot sensing systems; actor-critic; basal ganglia; motor learning; operant conditioning; reinforcement learning;
Conference_Titel :
Control and Decision Conference (2014 CCDC), The 26th Chinese
Conference_Location :
Changsha
Print_ISBN :
978-1-4799-3707-3
DOI :
10.1109/CCDC.2014.6853115