DocumentCode :
3449575
Title :
Cooperative coaching in robot learning
Author :
Schneider, Jeff G. ; Brown, Christopher M.
Author_Institution :
Robotics Inst., Carnegie Mellon Univ., Pittsburgh, PA, USA
Volume :
3
fYear :
1995
fDate :
5-9 Aug 1995
Firstpage :
332
Abstract :
Many closed loop learning algorithms perform gradient descent on a cost function with respect to the parameters of a learning controller. The authors observe that both local closed loop learners, which consider only the cost of the current time step, and optimal control based closed loop learners, which consider the future effects of control actions, can become stuck in sub-optimal local minima in the cost function. The authors propose the use of “cooperating coaches” to deal with this problem. Each coach attempts gradient descent based on its own cost function and they work together to avoid getting stuck in local minima. When one coach has achieved the best result it can (the gradient for its cost function is zero), another coach takes over to guide the search through the parameter space. The authors demonstrate cooperative coaching on the problem of curve tracking with an inverted pendulum and show that it yields faster, smoother tracking of target curves by combining the best aspects of two different coaches
Keywords :
closed loop systems; cooperative systems; learning (artificial intelligence); learning systems; optimal control; robot programming; robots; cooperative coaching; cost function; curve tracking; gradient descent; inverted pendulum; local closed loop learners; optimal control based closed loop learners; robot learning; search; Control systems; Cost function; Error correction; Optimal control; Orbital robotics; Robot control; Target tracking; Three-term control; Tracking loops; Training data;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Intelligent Robots and Systems 95. 'Human Robot Interaction and Cooperative Robots', Proceedings. 1995 IEEE/RSJ International Conference on
Conference_Location :
Pittsburgh, PA
Print_ISBN :
0-8186-7108-4
Type :
conf
DOI :
10.1109/IROS.1995.525905
Filename :
525905
Link To Document :
بازگشت