Title :
Efficient integration of multiple pronunciations in a large vocabulary decoder
Author :
Schramm, Hauke ; Aubert, Xavier L.
Author_Institution :
Philips Res. Lab., Aachen, Germany
Abstract :
The paper describes the improved handling of multiple pronunciations achieved in the Philips research decoder by (1) incorporating some prior information about their distributions and (2) combining the acoustic contributions of concurrent alternate word hypotheses. Starting from a baseline system where multiple pronunciations are treated as word copies without priors, an extension of the usual Viterbi decoding is presented which integrates unigram priors in a weighted sum of acoustic probabilities. Several approximations are discussed leading to new decoding aspects. Experimental results are presented for US broadcast news recordings. It is shown that the use of unigram priors has a clear positive impact on both error rate and decoding cost while the sum over multiple pronunciation contributions brings another small improvement. An overall 4% reduction of the error rate is achieved on the HUB-4 evaluation sets of 97 and 98
Keywords :
Viterbi decoding; probability; speech recognition; vocabulary; Philips research decoder; US broadcast news recordings; Viterbi decoding; acoustic contributions; acoustic probabilities; alternate word hypotheses; decoding cost; error rate; large vocabulary decoder; multiple pronunciations; prior distribution information; unigram priors; weighted sum; word copies; Broadcasting; Costs; Decoding; Error analysis; Laboratories; Lips; Probability distribution; Speech recognition; Viterbi algorithm; Vocabulary;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2000. ICASSP '00. Proceedings. 2000 IEEE International Conference on
Conference_Location :
Istanbul
Print_ISBN :
0-7803-6293-4
DOI :
10.1109/ICASSP.2000.862068