Title :
Learning from failed demonstrations in unreliable systems
Author :
Rai, Akshara ; de Chambrier, Guillaume ; Billard, Aude
Abstract :
This paper presents a method to teach a robot to play Ping Pong from failed demonstrations in a highly noisy and uncertain setting. To infer useful information from failed demonstrations, we use a MultiDonut Algorithm (Grollman and Billard, 2012) that minimises the probability of repeating a failed demonstration and generates new attempts similar but not quite the same as the demonstration. We compare human demonstrations against a random strategy and show that human demonstrations provide useful information and hence yield faster learning, especially in higher dimensions. We show that learning from observing failed attempts allows the robot to perform the task more reliably than any individual demonstrator did. We also show how this algorithm adapts to gradual deterioration in the system and increases the chances of success when interacting with an unreliable system.
Keywords :
automatic programming; humanoid robots; learning (artificial intelligence); probability; robot programming; MultiDonut algorithm; Ping Pong; human demonstrations; learning from failed demonstrations; probability minimisation; robot; unreliable systems; Games; Humanoid robots; Noise measurement; Training; Training data;
Conference_Titel :
Humanoid Robots (Humanoids), 2013 13th IEEE-RAS International Conference on
Print_ISBN :
978-1-4799-2617-6
DOI :
10.1109/HUMANOIDS.2013.7030007