Title :
Learning from human reward benefits from socio-competitive feedback
Author :
Guangliang Li ; Hung, Hayley ; Whiteson, Shimon ; Knox, W. Bradley
Author_Institution :
Univ. of Amsterdam, Amsterdam, Netherlands
Abstract :
Learning from rewards generated by a human trainer observing an agent in action has proven to be a powerful method for non-experts in autonomous agents to teach such agents to perform challenging tasks. Since the efficacy of this approach depends critically on the reward the trainer provides, we consider how the interaction between the trainer and the agent should be designed so as to increase the efficiency of the training process. This paper investigates the influence of the agent´s socio-competitive feedback on the human trainer´s training behavior and the agent´s learning. The results of our user study with 85 subjects suggest that the agent´s socio-competitive feedback substantially increases the engagement of the participants in the game task and improves the agents´ performance, even though the participants do not directly play the game but instead train the agent to do so. Moreover, making this feedback active further induces more subjects to train the agents longer but does not further improve agent performance. Our analysis suggests that this may be because some trainers train a more complex behavior in the agent that is appropriate for a different performance metric that is sometimes associated with the target task.
Keywords :
learning (artificial intelligence); multi-agent systems; agent behavior; agent learning; agent performance; agent training; human reward learning; socio-competitive feedback; Bidirectional control; Educational institutions; Facebook; Games; Lead; Training;
Conference_Titel :
Development and Learning and Epigenetic Robotics (ICDL-Epirob), 2014 Joint IEEE International Conferences on
Conference_Location :
Genoa
DOI :
10.1109/DEVLRN.2014.6982960