• DocumentCode
    730786
  • Title

    Discriminative uncertainty estimation for noise robust ASR

  • Author

    Tran, Dung T. ; Vincent, Emmanuel ; Jouvet, Denis

  • Author_Institution
    Inria, Villers-lès-Nancy, France
  • fYear
    2015
  • fDate
    19-24 April 2015
  • Firstpage
    5038
  • Lastpage
    5042
  • Abstract
    We consider the problem of uncertainty estimation for noiserobust ASR. Existing uncertainty estimation techniques improve ASR accuracy but they still exhibit a gap compared to the use of oracle uncertainty. This comes partly from the highly non-linear feature transformation and from additional assumptions such as Gaussian distribution and independence between frequency bins in the spectral domain. In this paper, we propose a method to rescale the estimated feature-domain full uncertainty covariance matrix in a statedependent fashion according to a discriminative criterion. The state-dependent and feature index-dependent scaling factors are learned from development data. Experimental evaluation on Track 1 of the 2nd CHiME challenge data shows that discriminative rescaling leads to better results than generative rescaling. Moreover, discriminative rescaling of the Wiener uncertainty estimator leads to 12% relative word error rate reduction compared to discriminative rescaling of the alternative estimator in [1].
  • Keywords
    Gaussian distribution; covariance matrices; signal denoising; speech recognition; 2nd CHiME challenge data; Gaussian distribution; automatic speech recognition; discriminative uncertainty estimation problem; estimated feature-domain full uncertainty covariance matrix; feature index-dependent scaling factors; frequency bins; noise robust ASR; nonlinear feature transformation; relative word error rate reduction; spectral domain; state-dependent scaling factors; Indexes; Speech; Speech processing; Automatic speech recognition; discriminative adaptation; noise robustness; uncertainty handling;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
  • Conference_Location
    South Brisbane, QLD
  • Type

    conf

  • DOI
    10.1109/ICASSP.2015.7178930
  • Filename
    7178930