• DocumentCode
    417291
  • Title

    Joint removal of additive and convolutional noise with model-based feature enhancement

  • Author

    Stouten, Veronique ; Van hamme, Hugo ; Wambacq, Patrick

  • Author_Institution
    ESAT, Katholieke Univ., Leuven, Heverlee, Belgium
  • Volume
    1
  • fYear
    2004
  • fDate
    17-21 May 2004
  • Abstract
    In this paper we describe how we successfully extended the model-based feature enhancement (MBFE) algorithm to jointly remove additive and convolutional noise from corrupted speech. Although a model of the clean speech can incorporate prior knowledge into the feature enhancement process, this model no longer yields an accurate fit if a different microphone is used. To cure the resulting performance degradation, we merge a new iterative EM algorithm to estimate the channel, and the MBFE-algorithm to remove nonstationary additive noise. In the latter, the parameters of a shifted clean speech HMM and a noise HMM are first combined by a vector Taylor series approximation and then the state-conditional MMSE-estimates of the clean speech are calculated. Recognition experiments confirmed the superior performance on the Aurora4 recognition task. An average relative reduction in WER of 12% and 2.8% on the clean and multi condition training respectively, was obtained compared to the Advanced Front-End standard.
  • Keywords
    channel estimation; error statistics; hidden Markov models; iterative methods; least mean squares methods; optimisation; series (mathematics); speech recognition; state estimation; Aurora4 recognition task; MBFE algorithm; WER reduction; additive convolutional noise removal; automatic speech recognition; channel estimate; corrupted speech; iterative EM algorithm; model-based feature enhancement; noise HMM; nonstationary additive noise; performance; shifted clean speech HMM; state-conditional MMSE-estimates; vector Taylor series approximation; Acoustic distortion; Acoustic noise; Additive noise; Cepstral analysis; Degradation; Hidden Markov models; Microphones; Speech enhancement; Taylor series; Working environment noise;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-8484-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.2004.1326144
  • Filename
    1326144