DocumentCode :
2182454
Title :
COACH: Learning continuous actions from COrrective Advice Communicated by Humans
Author :
Celemin, Carlos ; Ruiz-del-Solar, Javier
Author_Institution :
University of Chile, Advanced Mining Technology Center &Dept. of Elect. Eng., Santiago, Chile
fYear :
2015
fDate :
27-31 July 2015
Firstpage :
581
Lastpage :
586
Abstract :
COACH (COrrective Advice Communicated by Humans), a new interactive learning framework that allows non-expert humans to shape a policy through corrective advice, using a binary signal in the action domain of the agent, is proposed. One of the main innovative features of COACH is a mechanism for adaptively adjusting the amount of human feedback that a given action receives, taking into consideration past feedback. The performance of COACH is compared with the one of TAMER (Teaching an Agent Manually via Evaluative Reinforcement), ACTAMER (Actor-Critic TAMER), and an autonomous agent trained using SARSA(?) in two reinforcement learning problems. COACH outperforms all other learning frameworks in the reported experiments. In addition, results show that COACH is able to transfer successfully human knowledge to agents with continuous actions, being a complementary approach to TAMER, which is appropriate for teaching in discrete action domains.
Keywords :
Adaptation models; Computational modeling; Decision making; Legged locomotion; Training; Robot learning; human feedback in action domains; human teachers; interactive learning;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Advanced Robotics (ICAR), 2015 International Conference on
Conference_Location :
Istanbul, Turkey
Type :
conf
DOI :
10.1109/ICAR.2015.7251514
Filename :
7251514
Link To Document :
بازگشت