Title :
Joint n-best rescoring for repeated utterances in spoken dialog systems
Author :
Bohus, Dan ; Zweig, Geoffrey ; Nguyen, Patrick ; Li, Xiao
Author_Institution :
Microsoft Res., Redmond, WA
Abstract :
Due to speech recognition errors, repetitions are a frequent phenomenon in spoken dialog systems. In previous work (G. Zweig et al., 2008) we have proposed a joint decoding model that can leverage structural relationships between repeated utterances for improving recognition performance. In this paper we extend this work in two directions. First, we propose a direct, classification-based model for the same task. The new model can leverage features that were fundamentally hard to capture in the previous framework (e.g. spellings, false-starts, etc.) and leads to an additional performance improvement. Second, we show how both models can be used to perform a combined rescoring of two n-best lists that are part of a repetition pair.
Keywords :
decoding; interactive systems; pattern classification; speech coding; speech recognition; classification-based model; joint decoding model; joint n-best rescoring; repeated utterances; speech recognition errors; spoken dialog systems; Automatic speech recognition; Bayesian methods; Data analysis; Decoding; Natural languages; Pattern matching; Performance gain; Signal detection; Speech recognition; repetitions; rescoring; speech recognition;
Conference_Titel :
Spoken Language Technology Workshop, 2008. SLT 2008. IEEE
Conference_Location :
Goa
Print_ISBN :
978-1-4244-3471-8
Electronic_ISBN :
978-1-4244-3472-5
DOI :
10.1109/SLT.2008.4777858