• DocumentCode
    2839128
  • Title

    Double Gaussian based feature normalization for robust speech recognition

  • Author

    Liu, Bo ; Li-Rong Dai ; Li, Jin-Yu ; Wang, Ren-Hua

  • Author_Institution
    Univ. of Sci. & Technol. of China, Anhui, China
  • fYear
    2004
  • fDate
    15-18 Dec. 2004
  • Firstpage
    253
  • Lastpage
    256
  • Abstract
    In this paper, a new feature normalization approach, based on the cumulative density function (CDF) matching principle, is proposed. Since speech features in noisy environments usually follow bimodal distributions, we fully utilize this characteristic by representing the CDF of the features with a double Gaussian model. A feature normalization process is performed according to the estimated CDF. The experimental results on the Aurora2 database show that the performance of our method is much better than that of the conventional mean and variance normalization (MVN) method, and comparable to that of the method combining spectral subtraction and histogram equalization (HE). Moreover, further improvement has been gained by combining our method with a simple temporal feature smoothing process. This result suggests that our new method has the potential to be integrated with other techniques to provide even better performance.
  • Keywords
    Gaussian distribution; higher order statistics; signal denoising; smoothing methods; speech recognition; CDF matching principle; bimodal distribution; cumulative density function; double Gaussian based feature normalization; noisy environment speech features; robust speech recognition; temporal feature smoothing process; Density functional theory; Helium; Histograms; Parametric statistics; Robustness; Spatial databases; Speech enhancement; Speech recognition; Testing; Working environment noise;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Chinese Spoken Language Processing, 2004 International Symposium on
  • Print_ISBN
    0-7803-8678-7
  • Type

    conf

  • DOI
    10.1109/CHINSL.2004.1409634
  • Filename
    1409634