• DocumentCode
    565668
  • Title

    Efficient model learning for dialog management

  • Author

    Doshi, Finale ; Roy, Nicholas

  • Author_Institution
    CSAIL MIT, Cambridge, MA, USA
  • fYear
    2007
  • fDate
    9-11 March 2007
  • Firstpage
    65
  • Lastpage
    72
  • Abstract
    Intelligent planning algorithms such as the Partially Observable Markov Decision Process (POMDP) have succeeded in dialog management applications [10, 11, 12] because they are robust to the inherent uncertainty of human interaction. Like all dialog planning systems, however, POMDPs require an accurate model of the user (e.g., what the user might say or want). POMDPs are generally specified using a large probabilistic model with many parameters. These parameters are difficult to specify from domain knowledge, and gathering enough data to estimate the parameters accurately a priori is expensive. In this paper, we take a Bayesian approach to learning the user model simultaneously with dialog manager policy. At the heart of our approach is an efficient incremental update algorithm that allows the dialog manager to replan just long enough to improve the current dialog policy given data from recent interactions. The update process has a relatively small computational cost, preventing long delays in the interaction. We are able to demonstrate a robust dialog manager that learns from interaction data, out-performing a hand-coded model in simulation and in a robotic wheelchair application.
  • Keywords
    Bayes methods; Markov processes; human-robot interaction; interactive systems; learning (artificial intelligence); wheelchairs; Bayesian approach; POMDP; dialog management applications; dialog manager policy; human interaction uncertainty; incremental update algorithm; intelligent planning algorithms; model learning; partially observable Markov decision process; probabilistic model; robotic wheelchair application; robust dialog manager; Abstracts; Convergence; Face; History; Planning; Pragmatics; Robots; Human-robot interaction; decision-making under uncertainty; model learning;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Human-Robot Interaction (HRI), 2007 2nd ACM/IEEE International Conference on
  • Conference_Location
    Arlington, VA
  • ISSN
    2167-2121
  • Print_ISBN
    978-1-59593-617-2
  • Type

    conf

  • Filename
    6251718