DocumentCode :
3422432
Title :
Improved modulation spectrum normalization techniques for robust speech recognition
Author :
Pan, Chi-an ; Wang, Chieh-cheng ; Hung, Jeih-weih
Author_Institution :
Dept of Electr. Eng., Nat. Chi Nan Univ., Nantou
fYear :
2008
fDate :
March 31 2008-April 4 2008
Firstpage :
4089
Lastpage :
4092
Abstract :
The modulation spectra of speech features are often distorted due to environmental interferences. In order to reduce this distortion, in this paper we propose several approaches to normalize the power spectral density (PSD) of the feature stream to a reference function. These approaches include least-squares temporal filtering (LSTF), least-squares spectrum fitting (LSSF) and magnitude spectrum interpolation (MSI). It is shown that all the proposed approaches can effectively improve the speech recognition accuracy in various noise corrupted environments. In experiments conducted on the Aurora-2 noisy digits database with a complex back-end, these new approaches provide an average relative error reduction rate of over 40% when compared with the baseline MFCC processing.
Keywords :
filtering theory; interpolation; least mean squares methods; modulation; spectral analysis; speech recognition; Improved modulation spectrum normalization technique; least-squares spectrum fitting; least-squares temporal filtering; magnitude spectrum interpolation; power spectral density; reference function; robust speech recognition; Acoustic distortion; Cepstral analysis; Filtering; Interpolation; Mel frequency cepstral coefficient; Nonlinear filters; Robustness; Spatial databases; Speech recognition; Working environment noise; feature normalization; modulation spectrum; robust speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
Conference_Location :
Las Vegas, NV
ISSN :
1520-6149
Print_ISBN :
978-1-4244-1483-3
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2008.4518553
Filename :
4518553
Link To Document :
بازگشت