Title :
The use of discriminative belief tracking in POMDP-based dialogue systems
Author :
Dongho Kim ; Henderson, Matthew ; Gasic, Milica ; Tsiakoulis, Pirros ; Young, Steve
Author_Institution :
Dept. of Eng., Univ. of Cambridge, Cambridge, UK
Abstract :
Statistical spoken dialogue systems based on Partially Observable Markov Decision Processes (POMDPs) have been shown to be more robust to speech recognition errors by maintaining a belief distribution over multiple dialogue states and making policy decisions based on the entire distribution rather than the single most likely hypothesis. To date most POMDP-based systems have used generative trackers. However, concerns about modelling accuracy have created interest in discriminative methods, and recent results from the second Dialog State Tracking Challenge (DSTC2) have shown that discriminative trackers can significantly outperform generative models in terms of tracking accuracy. The aim of this paper is to investigate the extent to which these improvements translate into improved task completion rates when incorporated into a spoken dialogue system. To do this, the Recurrent Neural Network (RNN) tracker described by Henderson et al in DSTC2 was integrated into the Cambridge statistical dialogue system and compared with the existing generative Bayesian network tracker. Using a Gaussian Process (GP) based policy, the experimental results indicate that the system using the RNN tracker performs significantly better than the system with the original Bayesian network tracker.
Keywords :
Gaussian processes; Markov processes; interactive systems; recurrent neural nets; speech recognition; statistical analysis; Cambridge statistical dialogue system; DSTC2; GP based policy; Gaussian process based policy; POMDP-based dialogue systems; RNN tracker; belief distribution; dialog state tracking challenge; discriminative belief tracking; multiple dialogue states; partially observable Markov decision processes; policy decision making; recurrent neural network tracker; statistical spoken dialogue systems; Accuracy; Error analysis; Gaussian processes; Recurrent neural networks; Speech recognition; Training; Vectors; POMDP; belief tracking; dialogue management; recurrent neural networks; spoken dialogue systems;
Conference_Titel :
Spoken Language Technology Workshop (SLT), 2014 IEEE
DOI :
10.1109/SLT.2014.7078600