DocumentCode :
2980538
Title :
Adaptation of HMMS in the presence of additive and convolutional noise
Author :
Hirsch, Hans-Günter
Author_Institution :
Ericsson Eurolab Deutschland GmbH, Nuremberg, Germany
fYear :
1997
fDate :
14-17 Dec 1997
Firstpage :
412
Lastpage :
419
Abstract :
The performance of speech recognizers deteriorates in the case of a mismatch between the conditions during training and recognition. One difference is the presence of a stationary background noise during recognition which is also referred to as additive noise. Furthermore the recognition is influenced by the frequency response of the whole transmission channel from the speaker to the audio input of the recognizer. The term convolutional noise has been introduced for this type of distortion. Several approaches are known to compensate these effects individually or both together (Gales and Young, 1996). This paper describes an approach which compensates both types of noise. The scheme is based on an estimation of the noise spectrum (Hirsch and Ehrlicher, 1995). Furthermore the frequency response is iteratively estimated by using the alignment information of the best path in the Viterbi algorithm. The comparison between the spectra of the input signal and the spectra of the corresponding HMM (hidden Markov model) states is taken as basis for the filter estimation. The estimated additive and convolutional noise components are used as input to the well known Parallel Model Combination (PMC) approach (Gales, 1995) to adapt the whole word HMMs of a speaker independent connected word recognizer. Considerable improvements can be achieved in the presence of just one type of noise as well as in the presence of both types together
Keywords :
convolution; hidden Markov models; noise; performance evaluation; spectral analysis; speech recognition; Parallel Model Combination method; Viterbi algorithm; additive noise; audio input; convolutional noise; frequency response; hidden Markov model; noise compensation; noise spectrum estimation; performance; speaker independent recognizer; spectra; speech recognition; stationary background noise; training; transmission channel; Additive noise; Background noise; Convolution; Filters; Frequency estimation; Frequency response; Hidden Markov models; Speech recognition; State estimation; Viterbi algorithm;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Automatic Speech Recognition and Understanding, 1997. Proceedings., 1997 IEEE Workshop on
Conference_Location :
Santa Barbara, CA
Print_ISBN :
0-7803-3698-4
Type :
conf
DOI :
10.1109/ASRU.1997.659118
Filename :
659118
Link To Document :
بازگشت