DocumentCode :
980438
Title :
Combining expert neural networks using reinforcement feedback for learning primitive grasping behavior
Author :
Moussa, Medhat A.
Author_Institution :
Sch. of Eng., Univ. of Guelph, Ont., Canada
Volume :
15
Issue :
3
fYear :
2004
fDate :
5/1/2004 12:00:00 AM
Firstpage :
629
Lastpage :
638
Abstract :
This paper present an architecture for combining a mixture of experts. The architecture has two unique features: 1) it assumes no prior knowledge of the size or structure of the mixture and allows the number of experts to dynamically expand during training, and 2) reinforcement feedback is used to guide the combining/expansion operation. The architecture is particularly suitable for applications when there is a need to approximate a many-to-many mapping. An example of such a problem is the task of training a robot to grasp arbitrarily shaped objects. This task requires the approximation of a many-to-many mapping, since various configurations can be used to grasp an object, and several objects can share the same grasping configuration. Experiments in a simulated environment using a 28-object database showed how the algorithm dynamically combined and expanded a mixture of neural networks to achieve the learning task. The paper also presents a comparison with two other nonlearning approaches.
Keywords :
feedback; grippers; learning (artificial intelligence); neural nets; robot programming; arbitrary shaped objects; expert mixtures; expert neural networks; many-to-many mapping; modular architecture; nonlearning approaches; object database; primitive grasping behavior learning; reinforcement feedback; robot grasping configuration; robot learning; robot training; Councils; Databases; Encoding; Function approximation; Heuristic algorithms; Neural networks; Neurofeedback; Robot kinematics; Robot sensing systems; Feedback; Hand Strength; Learning; Neural Networks (Computer); Reinforcement (Psychology); Robotics;
fLanguage :
English
Journal_Title :
Neural Networks, IEEE Transactions on
Publisher :
ieee
ISSN :
1045-9227
Type :
jour
DOI :
10.1109/TNN.2004.824412
Filename :
1296690
Link To Document :
بازگشت