Title :
Imitation and Reinforcement Learning
Author :
Kober, Jens ; Peters, Jan
Author_Institution :
Dept. of Empirical Inference & Machine Learning, Max Planck Inst. for Biol. Cybern., Tübingen, Germany
fDate :
6/1/2010 12:00:00 AM
Abstract :
In this article, we present both novel learning algorithms and experiments using the dynamical system MPs. As such, we describe this MP representation in a way that it is straightforward to reproduce. We review an appropriate imitation learning method, i.e., locally weighted regression, and show how this method can be used both for initializing RL tasks as well as for modifying the start-up phase in a rhythmic task. We also show our current best-suited RL algorithm for this framework, i.e., PoWER. We present two complex motor tasks, i.e., ball-in-a-cup and ball paddling, learned on a real, physical Barrett WAM, using the methods presented in this article. Of particular interest is the ball-paddling application, as it requires a combination of both rhythmic and discrete dynamical systems MPs during the start-up phase to achieve a particular task.
Keywords :
discrete systems; learning (artificial intelligence); regression analysis; robots; ball paddling; ball-in-a-cup; discrete dynamical system; imitation learning; industrial robots; motor primitive; reinforcement learning; weighted regression; whole arm manipulator; Anthropomorphism; Humans; Intelligent robots; Learning systems; Legged locomotion; Manufacturing; Planar motors; Robot programming; Service robots; Stability;
Journal_Title :
Robotics Automation Magazine, IEEE
Conference_Location :
6/1/2010 12:00:00 AM
DOI :
10.1109/MRA.2010.936952