DocumentCode
2466498
Title
A non-linear model transformation for ML stochastic matching in additive noise
Author
Wong, Shuen Kong ; Shi, Bertram
Author_Institution
Dept. of Electr. & Electron. Eng., Hong Kong Univ. of Sci. & Technol., Kowloon, Hong Kong
fYear
1998
fDate
7-9 Dec 1998
Firstpage
143
Lastpage
148
Abstract
We present a non-linear model transformation for adapting Gaussian mixture HMMs using both static and dynamic MFCC observation vectors to the presence of additive noise. This transformation depends upon a few compensation coefficients which can be estimated from a short training token of noise. Alternatively, one can also apply maximum-likelihood stochastic matching to estimate the compensation coefficients from speech embedded in noise. This can eliminate the need for segmentation of pure noise from speech for the estimation and can also compensate for inaccuracies in the estimation of the compensation coefficients as well as those due to the approximations used in deriving the transformation
Keywords
Gaussian processes; cepstral analysis; hidden Markov models; maximum likelihood estimation; noise; speech recognition; Gaussian mixture HMM; ML stochastic matching; additive noise; approximations; cepstral coefficients; compensation coefficients; dynamic MFCC observation vectors; maximum-likelihood stochastic matching; nonlinear model transformation; short training noise token; speech; speech recognition; static MFCC observation vectors; Additive noise; Filter bank; Gaussian noise; Hidden Markov models; Maximum likelihood estimation; Mel frequency cepstral coefficient; Noise shaping; Speech enhancement; Stochastic resonance; Working environment noise;
fLanguage
English
Publisher
ieee
Conference_Titel
Multimedia Signal Processing, 1998 IEEE Second Workshop on
Conference_Location
Redondo Beach, CA
Print_ISBN
0-7803-4919-9
Type
conf
DOI
10.1109/MMSP.1998.738926
Filename
738926
Link To Document