• DocumentCode
    2160307
  • Title

    A study on bias-based speech signal conditioning techniques for improving the robustness of automatic speech recognition

  • Author

    Chowdhury, Md Foezur Rahman ; Selouani, Sid-Ahmed ; Shaughnessy, Douglas O.

  • Author_Institution
    INRS - EMT, Univ. du Quebec, Quebec City, QC
  • fYear
    2009
  • fDate
    3-6 May 2009
  • Firstpage
    664
  • Lastpage
    669
  • Abstract
    Automatic speech recognition (ASR) performs poorly when the training conditions greatly mismatch the testing conditions. Additive background noises and channel distortion are responsible mostly for these mismatches. These mismatches introduce highly non-linear terms in the acoustic model of speech in both log-spectral and the cepstral domains. Current ASR is based on simple linear approximation of these non-linear functions in the cepstral domain in order to avoid model complexities. This linear modeling approach transforms the channel distortion into an additive bias term in the cepstral domain under the assumption of high SNR, which is barely true in practical situations. Several algorithms have been developed to estimate this bias term and make compensations either in feature space or in the model domain to improve the robustness of ASR. In this paper, we explore these bias estimate techniques for both stationary and non-stationary acoustic environments to find their applicability for self-adaptable ASR.
  • Keywords
    distortion; nonlinear functions; speech recognition; additive background noises; automatic speech recognition; bias-based speech signal conditioning techniques; cepstral domains; channel distortion; linear approximation; log-spectral domain; nonlinear functions; Acoustic distortion; Acoustic testing; Automatic speech recognition; Automatic testing; Background noise; Cepstral analysis; Noise robustness; Nonlinear acoustics; Nonlinear distortion; Performance evaluation; Automatic speech recognition; additive noise; channel bias; joint bias removal; robust ASR; self-adaptable ASR;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Electrical and Computer Engineering, 2009. CCECE '09. Canadian Conference on
  • Conference_Location
    St. John´s, NL
  • ISSN
    0840-7789
  • Print_ISBN
    978-1-4244-3509-8
  • Electronic_ISBN
    0840-7789
  • Type

    conf

  • DOI
    10.1109/CCECE.2009.5090212
  • Filename
    5090212