DocumentCode
716564
Title
Inverse reinforcement learning of behavioral models for online-adapting navigation strategies
Author
Herman, Michael ; Fischer, Volker ; Gindele, Tobias ; Burgard, Wolfram
Author_Institution
Corp. Sector Res. & Adv. Eng., Robert Bosch GmbH, Stuttgart, Germany
fYear
2015
fDate
26-30 May 2015
Firstpage
3215
Lastpage
3222
Abstract
To increase the acceptance of autonomous systems in populated environments, it is indispensable to teach them social behavior. We would expect a social robot, which plans its motions among humans, to consider both the social acceptability of its behavior as well as task constraints, such as time limits. These requirements are often contradictory and therefore resulting in a trade-off. For example, a robot has to decide whether it is more important to quickly achieve its goal or to comply with social conventions, such as the proximity to humans, i.e., the robot has to react adaptively to task-specific priorities. In this paper, we present a method for priority-adaptive navigation of mobile autonomous systems, which optimizes the social acceptability of the behavior while meeting task constraints. We learn acceptability-dependent behavioral models from human demonstrations by using maximum entropy (MaxEnt) inverse reinforcement learning (IRL). These models are generative and describe the learned stochastic behavior. We choose the optimum behavioral model by maximizing the social acceptability under constraints on expected time-limits and reliabilities. This approach is evaluated in the context of driving behaviors based on the highway scenario of Levine et al. [1].
Keywords
learning (artificial intelligence); mobile robots; path planning; MaxEnt inverse reinforcement learning; acceptability-dependent behavioral models; behavioral models; driving behaviors context; maximum entropy; mobile autonomous systems; online-adapting navigation strategy; priority-adaptive navigation; social behavior; social robot; task constraints; Adaptation models; Computational modeling; Navigation; Optimization; Robots; Stochastic processes; Trajectory;
fLanguage
English
Publisher
ieee
Conference_Titel
Robotics and Automation (ICRA), 2015 IEEE International Conference on
Conference_Location
Seattle, WA
Type
conf
DOI
10.1109/ICRA.2015.7139642
Filename
7139642
Link To Document