Title :
Natural Language Generation as Incremental Planning Under Uncertainty: Adaptive Information Presentation for Statistical Dialogue Systems
Author :
Rieser, Verena ; Lemon, Oliver ; Keizer, Simon
Author_Institution :
Sch. of Math. & Comput. Sci. (MACS), Heriot-Watt Univ., Edinburgh, UK
Abstract :
We present and evaluate a novel approach to natural language generation (NLG) in statistical spoken dialogue systems (SDS) using a data-driven statistical optimization framework for incremental information presentation (IP), where there is a trade-off to be solved between presenting “enough" information to the user while keeping the utterances short and understandable. The trained IP model is adaptive to variation from the current generation context (e.g. a user and a non-deterministic sentence planner), and it incrementally adapts the IP policy at the turn level. Reinforcement learning is used to automatically optimize the IP policy with respect to a data-driven objective function. In a case study on presenting restaurant information, we show that an optimized IP strategy trained on Wizard-of-Oz data outperforms a baseline mimicking the wizard behavior in terms of total reward gained. The policy is then also tested with real users, and improves on a conventional hand-coded IP strategy used in a deployed SDS in terms of overall task success. The evaluation found that the trained IP strategy significantly improves dialogue task completion for real users, with up to a 8.2% increase in task success. This methodology also provides new insights into the nature of the IP problem, which has previously been treated as a module following dialogue management with no access to lower-level context features (e.g. from a surface realizer and/or speech synthesizer).
Keywords :
learning (artificial intelligence); natural language processing; planning (artificial intelligence); IP policy; NLG; SDS; Wizard-of-Oz data; adaptive information presentation; data-driven objective function; data-driven statistical optimization framework; incremental planning; natural language generation; reinforcement learning; restaurant information; statistical spoken dialogue systems; trained IP model; Context; Databases; IP networks; Planning; Speech; Speech processing; Uncertainty; Information presentation; natural language generation; natural language user interfaces; reinforcement learning;
Journal_Title :
Audio, Speech, and Language Processing, IEEE/ACM Transactions on
DOI :
10.1109/TASL.2014.2315271