DocumentCode
730780
Title
Estimating confidence scores on ASR results using recurrent neural networks
Author
Kalgaonkar, Kaustubh ; Chaojun Liu ; Yifan Gong ; Kaisheng Yao
Author_Institution
Microsoft Corp., Redmond, WA, USA
fYear
2015
fDate
19-24 April 2015
Firstpage
4999
Lastpage
5003
Abstract
In this paper we present a confidence estimation system using recurrent neural networks (RNN) and compare it to a traditional multilayered perception (MLP) based system. The ability of RNN to capture sequence information and improve decisions using processed history was main motivation to explore RNN´s for confidence estimation. In this paper we also explore two subtle variations of confidence estimator: one that uses objective extracted over the entire sequence for training, and other that uses dynamic programming to decode and estimate confidence on all the words of the sequence jointly. In our experiments, we observed that for a constant false positive (FP) rate of 3% we can secure a relative reduction of 10% in false negative (FN) rate when we replaced a MLP in confidence estimator with a RNN.We also observed that relative gains achieved by a RNN based confidence estimator are directly proportional to the number of word in the utterances.
Keywords
multilayer perceptrons; recurrent neural nets; speech recognition; ASR results; confidence estimation system; constant false positive; dynamic programming; recurrent neural networks; sequence information; traditional multilayered perception; Acoustics; Estimation; Recurrent neural networks; Speech; Speech processing; Speech recognition; Training; Confidence Measures; Recurrent Neural Network; Word Identity;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
Conference_Location
South Brisbane, QLD
Type
conf
DOI
10.1109/ICASSP.2015.7178922
Filename
7178922
Link To Document