• DocumentCode
    179569
  • Title

    Fusion of multiple uncertainty estimators and propagators for noise robust ASR

  • Author

    Tran, D.T. ; Vincent, Emmanuel ; Jouvet, Denis

  • Author_Institution
    Inria, Villers-lès-Nancy, France
  • fYear
    2014
  • fDate
    4-9 May 2014
  • Firstpage
    5512
  • Lastpage
    5516
  • Abstract
    Uncertainty decoding has been successfully used for speech recognition in highly nonstationary noise environments. Yet, accurate estimation of the uncertainty on the denoised signals and propagation to the features remain difficult. In this work, we propose to fuse the uncertainty estimates obtained from different uncertainty estimators and propagators by linear combination. The fusion coefficients are optimized by minimizing a measure of divergence with oracle estimates on development data. Using the Kullback-Leibler divergence, we obtain 18% relative error rate reduction on the 2nd CHiME Challenge with respect to conventional decoding, that is about twice as much as the reduction achieved by the best single uncertainty estimator and propagator.
  • Keywords
    acoustic noise; decoding; signal denoising; speech recognition; 2nd CHiME challenge; Kullback-Leibler divergence; automatic speech recognition; denoised signals; error rate reduction; fusion coefficients; linear combination; multiple uncertainty estimators; noise robust ASR; nonstationary noise environments; uncertainty decoding; uncertainty propagators; Covariance matrices; Decoding; Speech; Speech enhancement; Speech recognition; Uncertainty; Vectors; Noise robust ASR; uncertainty handling;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
  • Conference_Location
    Florence
  • Type

    conf

  • DOI
    10.1109/ICASSP.2014.6854657
  • Filename
    6854657