Title :
Confidence measures for large vocabulary continuous speech recognition
Author :
Wessel, Frank ; Schlüter, Ralf ; Macherey, Klaus ; Ney, Hermann
Author_Institution :
Lehrstuhl fur Inf. VI, Tech. Hochschule Aachen, Germany
fDate :
3/1/2001 12:00:00 AM
Abstract :
In this paper, we present several confidence measures for large vocabulary continuous speech recognition. We propose to estimate the confidence of a hypothesized word directly as its posterior probability, given all acoustic observations of the utterance. These probabilities are computed on word graphs using a forward-backward algorithm. We also study the estimation of posterior probabilities on N-best lists instead of word graphs and compare both algorithms in detail. In addition, we compare the posterior probabilities with two alternative confidence measures, i.e., the acoustic stability and the hypothesis density. We present experimental results on five different corpora: the Dutch ARISE 1k evaluation corpus, the German Verbmobil ´98 7k evaluation corpus, the English North American Business ´94 20k and 64k development corpora, and the English Broadcast News ´96 65k evaluation corpus. We show that the posterior probabilities computed on word graphs outperform all other confidence measures. The relative reduction in confidence error rate ranges between 19% and 35% compared to the baseline confidence error rate
Keywords :
error statistics; graph theory; speech recognition; Dutch ARISE 1k evaluation corpus; English Broadcast News ´96 65k evaluation corpus; English North American Business ´94 20k corpus; English North American Business ´94 64k corpus; German Verbmobil ´98 7k evaluation corpus; N-best lists; acoustic stability; confidence measures; error rate; forward-backward algorithm; hypothesis density; hypothesized word; large vocabulary continuous speech recognition; posterior probability; utterance; word graphs; Acoustic measurements; Broadcasting; Density measurement; Error analysis; Error correction; Position measurement; Probability; Speech recognition; Stability; Vocabulary;
Journal_Title :
Speech and Audio Processing, IEEE Transactions on