• DocumentCode
    2963542
  • Title

    Adaptive model-based technique for robust speech recognition

  • Author

    Graciarena, Martin

  • Author_Institution
    Inst. de Ingenieria Biomedica, Buenos Aires Univ., Argentina
  • Volume
    2
  • fYear
    2000
  • fDate
    Oct. 29 2000-Nov. 1 2000
  • Firstpage
    1512
  • Abstract
    This paper presents an algorithm for maximum likelihood (ML) estimation of the noise hidden Markov model (HMM) parameters in the context of model-based techniques. It is applied when only the clean speech HMM and noisy speech adaptation data are available. The noisy speech model is a generalization of Rose\´s (Rose et al. 1994) integrated parametric model to the Gaussian mixture HMM formulation. Observations from clean speech HMM and noise HMM models are combined in the log spectra domain, through a corruption function, to generate noisy speech observations. The estimation algorithm uses the "max" approximation as the corruption function. Noisy digit recognition experiments, with NOISEX-92, show that the same performance is achieved between the proposed model using either the estimated noise model from a single noisy utterance or a noise model calculated from silent sections of several utterances.
  • Keywords
    Gaussian noise; adaptive estimation; hidden Markov models; maximum likelihood estimation; speech recognition; Gaussian mixture HMM formulation; HMM parameters; ML estimation; NOISEX-92; Rose´s integrated parametric model; adaptive model-based technique; clean speech HMM; corruption function; log spectra domain; max approximation; maximum likelihood estimation; model-based techniques; noise hidden Markov model parameters; noisy digit recognition experiments; noisy speech adaptation data; noisy speech observation; noisy utterance; performance; robust speech recognition; silent sections; Additive noise; Context modeling; Gaussian noise; Hidden Markov models; Maximum likelihood estimation; Noise generators; Noise robustness; Speech enhancement; Speech recognition; Working environment noise;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signals, Systems and Computers, 2000. Conference Record of the Thirty-Fourth Asilomar Conference on
  • Conference_Location
    Pacific Grove, CA, USA
  • ISSN
    1058-6393
  • Print_ISBN
    0-7803-6514-3
  • Type

    conf

  • DOI
    10.1109/ACSSC.2000.911243
  • Filename
    911243