Efficient model learning for dialog management

Author

Doshi, Finale ; Roy, Nicholas

Author_Institution

CSAIL MIT, Cambridge, MA, USA

fYear

2007

fDate

9-11 March 2007

Firstpage

65

Lastpage

72

Abstract

Intelligent planning algorithms such as the Partially Observable Markov Decision Process (POMDP) have succeeded in dialog management applications [10, 11, 12] because they are robust to the inherent uncertainty of human interaction. Like all dialog planning systems, however, POMDPs require an accurate model of the user (e.g., what the user might say or want). POMDPs are generally specified using a large probabilistic model with many parameters. These parameters are difficult to specify from domain knowledge, and gathering enough data to estimate the parameters accurately a priori is expensive. In this paper, we take a Bayesian approach to learning the user model simultaneously with dialog manager policy. At the heart of our approach is an efficient incremental update algorithm that allows the dialog manager to replan just long enough to improve the current dialog policy given data from recent interactions. The update process has a relatively small computational cost, preventing long delays in the interaction. We are able to demonstrate a robust dialog manager that learns from interaction data, out-performing a hand-coded model in simulation and in a robotic wheelchair application.

Keywords

Bayes methods; Markov processes; human-robot interaction; interactive systems; learning (artificial intelligence); wheelchairs; Bayesian approach; POMDP; dialog management applications; dialog manager policy; human interaction uncertainty; incremental update algorithm; intelligent planning algorithms; model learning; partially observable Markov decision process; probabilistic model; robotic wheelchair application; robust dialog manager; Abstracts; Convergence; Face; History; Planning; Pragmatics; Robots; Human-robot interaction; decision-making under uncertainty; model learning;

fLanguage

English

Publisher

ieee

Conference_Titel

Human-Robot Interaction (HRI), 2007 2nd ACM/IEEE International Conference on

Conference_Location

Arlington, VA

ISSN

2167-2121

Print_ISBN

978-1-59593-617-2

Type

conf

Filename

6251718