DocumentCode :
2466498
Title :
A non-linear model transformation for ML stochastic matching in additive noise
Author :
Wong, Shuen Kong ; Shi, Bertram
Author_Institution :
Dept. of Electr. & Electron. Eng., Hong Kong Univ. of Sci. & Technol., Kowloon, Hong Kong
fYear :
1998
fDate :
7-9 Dec 1998
Firstpage :
143
Lastpage :
148
Abstract :
We present a non-linear model transformation for adapting Gaussian mixture HMMs using both static and dynamic MFCC observation vectors to the presence of additive noise. This transformation depends upon a few compensation coefficients which can be estimated from a short training token of noise. Alternatively, one can also apply maximum-likelihood stochastic matching to estimate the compensation coefficients from speech embedded in noise. This can eliminate the need for segmentation of pure noise from speech for the estimation and can also compensate for inaccuracies in the estimation of the compensation coefficients as well as those due to the approximations used in deriving the transformation
Keywords :
Gaussian processes; cepstral analysis; hidden Markov models; maximum likelihood estimation; noise; speech recognition; Gaussian mixture HMM; ML stochastic matching; additive noise; approximations; cepstral coefficients; compensation coefficients; dynamic MFCC observation vectors; maximum-likelihood stochastic matching; nonlinear model transformation; short training noise token; speech; speech recognition; static MFCC observation vectors; Additive noise; Filter bank; Gaussian noise; Hidden Markov models; Maximum likelihood estimation; Mel frequency cepstral coefficient; Noise shaping; Speech enhancement; Stochastic resonance; Working environment noise;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Multimedia Signal Processing, 1998 IEEE Second Workshop on
Conference_Location :
Redondo Beach, CA
Print_ISBN :
0-7803-4919-9
Type :
conf
DOI :
10.1109/MMSP.1998.738926
Filename :
738926
Link To Document :
بازگشت