DocumentCode :
3600085
Title :
Learning from failed demonstrations in unreliable systems
Author :
Rai, Akshara ; de Chambrier, Guillaume ; Billard, Aude
fYear :
2013
Firstpage :
410
Lastpage :
416
Abstract :
This paper presents a method to teach a robot to play Ping Pong from failed demonstrations in a highly noisy and uncertain setting. To infer useful information from failed demonstrations, we use a MultiDonut Algorithm (Grollman and Billard, 2012) that minimises the probability of repeating a failed demonstration and generates new attempts similar but not quite the same as the demonstration. We compare human demonstrations against a random strategy and show that human demonstrations provide useful information and hence yield faster learning, especially in higher dimensions. We show that learning from observing failed attempts allows the robot to perform the task more reliably than any individual demonstrator did. We also show how this algorithm adapts to gradual deterioration in the system and increases the chances of success when interacting with an unreliable system.
Keywords :
automatic programming; humanoid robots; learning (artificial intelligence); probability; robot programming; MultiDonut algorithm; Ping Pong; human demonstrations; learning from failed demonstrations; probability minimisation; robot; unreliable systems; Games; Humanoid robots; Noise measurement; Training; Training data;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Humanoid Robots (Humanoids), 2013 13th IEEE-RAS International Conference on
ISSN :
2164-0572
Print_ISBN :
978-1-4799-2617-6
Type :
conf
DOI :
10.1109/HUMANOIDS.2013.7030007
Filename :
7030007
Link To Document :
بازگشت