• DocumentCode
    768908
  • Title

    Cepstrum-domain acoustic feature compensation based on decomposition of speech and noise for ASR in noisy environments

  • Author

    Kim, Hong Kook ; Rose, Richard C.

  • Author_Institution
    Dept. of Inf. & Commun., Kwangju Inst. of Sci. & Technol., South Korea
  • Volume
    11
  • Issue
    5
  • fYear
    2003
  • Firstpage
    435
  • Lastpage
    446
  • Abstract
    This paper presents a set of acoustic feature pre-processing techniques that are applied to improving automatic speech recognition (ASR) performance on noisy speech recognition tasks. The principal contribution of this paper is an approach for cepstrum-domain feature compensation in ASR which is motivated by techniques for decomposing speech and noise that were originally developed for noisy speech enhancement. This approach is applied in combination with other feature compensation algorithms to compensating ASR features obtained from a mel-filterbank cepstrum coefficient front-end. Performance comparisons are made with respect to the application of the minimum mean squared error log spectral amplitude (MMSE-LSA) estimator based speech enhancement algorithm prior to feature analysis. An experimental study is presented where the feature compensation approaches described in the paper are found to greatly reduce ASR word error rate compared to uncompensated features under environmental and channel mismatched conditions.
  • Keywords
    acoustic noise; acoustic signal processing; cepstral analysis; channel bank filters; least mean squares methods; parameter estimation; speech enhancement; speech recognition; ASR performance; MMSE log spectral amplitude estimator; MMSE-LSA estimator; acoustic feature preprocessing techniques; automatic speech recognition; cepstrum-domain acoustic feature compensation; channel mismatched conditions; environmental conditions; feature compensation algorithms; mel-filterbank cepstrum coefficient front-end; minimum mean squared error; noise decomposition; noisy environments; noisy speech enhancement; noisy speech recognition; speech decomposition; word error rate reduction; Acoustic noise; Amplitude estimation; Automatic speech recognition; Cepstral analysis; Cepstrum; Performance analysis; Speech analysis; Speech enhancement; Speech recognition; Working environment noise;
  • fLanguage
    English
  • Journal_Title
    Speech and Audio Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1063-6676
  • Type

    jour

  • DOI
    10.1109/TSA.2003.815515
  • Filename
    1223593