Title :
Large vocabulary decoding and confidence estimation using word posterior probabilities
Author :
Evermann, G. ; Woodland, P.C.
Author_Institution :
Dept. of Eng., Cambridge Univ., UK
Abstract :
The paper investigates the estimation of word posterior probabilities based on word lattices and presents applications of these posteriors in a large vocabulary speech recognition system. A novel approach to integrating these word posterior probability distributions into a conventional Viterbi decoder is presented. The problem of the robust estimation of confidence scores from word posteriors is examined and a method based on decision trees is suggested. The effectiveness of these techniques is demonstrated on the broadcast news and the conversational telephone speech corpora where improvements both in terms of word error rate and normalised cross entropy were achieved compared to the baseline HTK evaluation systems
Keywords :
Viterbi decoding; decision trees; estimation theory; probability; speech recognition; vocabulary; Viterbi decoder; broadcast news; decision trees; large vocabulary decoding; large vocabulary speech recognition system; normalised cross entropy; robust confidence score estimation; telephone speech corpora; word error rate; word lattices; word posterior probabilities; Broadcasting; Decision trees; Decoding; Lattices; Probability distribution; Robustness; Speech recognition; Telephony; Viterbi algorithm; Vocabulary;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2000. ICASSP '00. Proceedings. 2000 IEEE International Conference on
Conference_Location :
Istanbul
Print_ISBN :
0-7803-6293-4
DOI :
10.1109/ICASSP.2000.862067