Natural Language Generation as Incremental Planning Under Uncertainty: Adaptive Information Presentation for Statistical Dialogue Systems

Author

Rieser, Verena ; Lemon, Oliver ; Keizer, Simon

Author_Institution

Sch. of Math. & Comput. Sci. (MACS), Heriot-Watt Univ., Edinburgh, UK

Volume

22

Issue

5

fYear

2014

fDate

May-14

Firstpage

979

Lastpage

994

Abstract

We present and evaluate a novel approach to natural language generation (NLG) in statistical spoken dialogue systems (SDS) using a data-driven statistical optimization framework for incremental information presentation (IP), where there is a trade-off to be solved between presenting “enough" information to the user while keeping the utterances short and understandable. The trained IP model is adaptive to variation from the current generation context (e.g. a user and a non-deterministic sentence planner), and it incrementally adapts the IP policy at the turn level. Reinforcement learning is used to automatically optimize the IP policy with respect to a data-driven objective function. In a case study on presenting restaurant information, we show that an optimized IP strategy trained on Wizard-of-Oz data outperforms a baseline mimicking the wizard behavior in terms of total reward gained. The policy is then also tested with real users, and improves on a conventional hand-coded IP strategy used in a deployed SDS in terms of overall task success. The evaluation found that the trained IP strategy significantly improves dialogue task completion for real users, with up to a 8.2% increase in task success. This methodology also provides new insights into the nature of the IP problem, which has previously been treated as a module following dialogue management with no access to lower-level context features (e.g. from a surface realizer and/or speech synthesizer).

Keywords

learning (artificial intelligence); natural language processing; planning (artificial intelligence); IP policy; NLG; SDS; Wizard-of-Oz data; adaptive information presentation; data-driven objective function; data-driven statistical optimization framework; incremental planning; natural language generation; reinforcement learning; restaurant information; statistical spoken dialogue systems; trained IP model; Context; Databases; IP networks; Planning; Speech; Speech processing; Uncertainty; Information presentation; natural language generation; natural language user interfaces; reinforcement learning;

fLanguage

English

Journal_Title

Audio, Speech, and Language Processing, IEEE/ACM Transactions on

Publisher

ieee

ISSN

2329-9290

Type

jour

DOI

10.1109/TASL.2014.2315271

Filename

6782700