DocumentCode :
730780
Title :
Estimating confidence scores on ASR results using recurrent neural networks
Author :
Kalgaonkar, Kaustubh ; Chaojun Liu ; Yifan Gong ; Kaisheng Yao
Author_Institution :
Microsoft Corp., Redmond, WA, USA
fYear :
2015
fDate :
19-24 April 2015
Firstpage :
4999
Lastpage :
5003
Abstract :
In this paper we present a confidence estimation system using recurrent neural networks (RNN) and compare it to a traditional multilayered perception (MLP) based system. The ability of RNN to capture sequence information and improve decisions using processed history was main motivation to explore RNN´s for confidence estimation. In this paper we also explore two subtle variations of confidence estimator: one that uses objective extracted over the entire sequence for training, and other that uses dynamic programming to decode and estimate confidence on all the words of the sequence jointly. In our experiments, we observed that for a constant false positive (FP) rate of 3% we can secure a relative reduction of 10% in false negative (FN) rate when we replaced a MLP in confidence estimator with a RNN.We also observed that relative gains achieved by a RNN based confidence estimator are directly proportional to the number of word in the utterances.
Keywords :
multilayer perceptrons; recurrent neural nets; speech recognition; ASR results; confidence estimation system; constant false positive; dynamic programming; recurrent neural networks; sequence information; traditional multilayered perception; Acoustics; Estimation; Recurrent neural networks; Speech; Speech processing; Speech recognition; Training; Confidence Measures; Recurrent Neural Network; Word Identity;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
Conference_Location :
South Brisbane, QLD
Type :
conf
DOI :
10.1109/ICASSP.2015.7178922
Filename :
7178922
Link To Document :
بازگشت