• DocumentCode
    62315
  • Title

    Natural Language Generation as Incremental Planning Under Uncertainty: Adaptive Information Presentation for Statistical Dialogue Systems

  • Author

    Rieser, Verena ; Lemon, Oliver ; Keizer, Simon

  • Author_Institution
    Sch. of Math. & Comput. Sci. (MACS), Heriot-Watt Univ., Edinburgh, UK
  • Volume
    22
  • Issue
    5
  • fYear
    2014
  • fDate
    May-14
  • Firstpage
    979
  • Lastpage
    994
  • Abstract
    We present and evaluate a novel approach to natural language generation (NLG) in statistical spoken dialogue systems (SDS) using a data-driven statistical optimization framework for incremental information presentation (IP), where there is a trade-off to be solved between presenting “enough" information to the user while keeping the utterances short and understandable. The trained IP model is adaptive to variation from the current generation context (e.g. a user and a non-deterministic sentence planner), and it incrementally adapts the IP policy at the turn level. Reinforcement learning is used to automatically optimize the IP policy with respect to a data-driven objective function. In a case study on presenting restaurant information, we show that an optimized IP strategy trained on Wizard-of-Oz data outperforms a baseline mimicking the wizard behavior in terms of total reward gained. The policy is then also tested with real users, and improves on a conventional hand-coded IP strategy used in a deployed SDS in terms of overall task success. The evaluation found that the trained IP strategy significantly improves dialogue task completion for real users, with up to a 8.2% increase in task success. This methodology also provides new insights into the nature of the IP problem, which has previously been treated as a module following dialogue management with no access to lower-level context features (e.g. from a surface realizer and/or speech synthesizer).
  • Keywords
    learning (artificial intelligence); natural language processing; planning (artificial intelligence); IP policy; NLG; SDS; Wizard-of-Oz data; adaptive information presentation; data-driven objective function; data-driven statistical optimization framework; incremental planning; natural language generation; reinforcement learning; restaurant information; statistical spoken dialogue systems; trained IP model; Context; Databases; IP networks; Planning; Speech; Speech processing; Uncertainty; Information presentation; natural language generation; natural language user interfaces; reinforcement learning;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE/ACM Transactions on
  • Publisher
    ieee
  • ISSN
    2329-9290
  • Type

    jour

  • DOI
    10.1109/TASL.2014.2315271
  • Filename
    6782700