Title :
Towards improving ASR robustness for PSN & GSM telephone applications
Author :
Mokbel, C. ; Mauuary, L. ; Jouvet, D. ; Monné, J. ; Sorin, C. ; Simonin, J. ; Bartkova, K.
Author_Institution :
CNET, Lannion, France
fDate :
30 Sep-1 Oct 1996
Abstract :
In real-life applications, the speech recognition system errors are mainly due to inadequate detection of speech segments, unreliable rejection of out-of-vocabulary (OOV) words, and noise and transmission channel effects. In this paper, we present the results of several experiments carried out on field vs. laboratory databases and on databases collected over PSN and GSM networks. The main sources of errors are analyzed. Preprocessing techniques as well as HMM adaptation techniques are used to increase the robustness to mismatches between training and testing conditions. We show that a blind equalization scheme improves significantly the recognition accuracy on both field and GSM data. Bayesian adaptation of hidden Markov models (HMM) parameters produces robust models to field conditions. The obtained results prove that HMM adaptation and preprocessing techniques can be advantageously combined, in order to improve ASR robustness
Keywords :
Bayes methods; adaptive filters; hidden Markov models; noise; speech recognition; telephony; ASR robustness; Bayesian adaptation; GSM network; HMM adaptation techniques; PSN network; blind equalization scheme; hidden Markov models; out-of-vocabulary words; recognition accuracy; speech recognition; speech segments; transmission channel effects; Automatic speech recognition; Databases; Error analysis; GSM; Hidden Markov models; Laboratories; Noise robustness; Speech enhancement; Speech recognition; Telephony;
Conference_Titel :
Interactive Voice Technology for Telecommunications Applications, 1996. Proceedings., Third IEEE Workshop on
Conference_Location :
Basking Ridge, NJ
Print_ISBN :
0-7803-3238-5
DOI :
10.1109/IVTTA.1996.552763