DocumentCode
2839128
Title
Double Gaussian based feature normalization for robust speech recognition
Author
Liu, Bo ; Li-Rong Dai ; Li, Jin-Yu ; Wang, Ren-Hua
Author_Institution
Univ. of Sci. & Technol. of China, Anhui, China
fYear
2004
fDate
15-18 Dec. 2004
Firstpage
253
Lastpage
256
Abstract
In this paper, a new feature normalization approach, based on the cumulative density function (CDF) matching principle, is proposed. Since speech features in noisy environments usually follow bimodal distributions, we fully utilize this characteristic by representing the CDF of the features with a double Gaussian model. A feature normalization process is performed according to the estimated CDF. The experimental results on the Aurora2 database show that the performance of our method is much better than that of the conventional mean and variance normalization (MVN) method, and comparable to that of the method combining spectral subtraction and histogram equalization (HE). Moreover, further improvement has been gained by combining our method with a simple temporal feature smoothing process. This result suggests that our new method has the potential to be integrated with other techniques to provide even better performance.
Keywords
Gaussian distribution; higher order statistics; signal denoising; smoothing methods; speech recognition; CDF matching principle; bimodal distribution; cumulative density function; double Gaussian based feature normalization; noisy environment speech features; robust speech recognition; temporal feature smoothing process; Density functional theory; Helium; Histograms; Parametric statistics; Robustness; Spatial databases; Speech enhancement; Speech recognition; Testing; Working environment noise;
fLanguage
English
Publisher
ieee
Conference_Titel
Chinese Spoken Language Processing, 2004 International Symposium on
Print_ISBN
0-7803-8678-7
Type
conf
DOI
10.1109/CHINSL.2004.1409634
Filename
1409634
Link To Document