• DocumentCode
    50342
  • Title

    Feature Denoising Using Joint Sparse Representation for In-Car Speech Recognition

  • Author

    Weifeng Li ; Yicong Zhou ; Poh, Norman ; Fei Zhou ; Qingmin Liao

  • Author_Institution
    Dept. of Electron. Eng., Tsinghua Univ., Beijing, China
  • Volume
    20
  • Issue
    7
  • fYear
    2013
  • fDate
    Jul-13
  • Firstpage
    681
  • Lastpage
    684
  • Abstract
    We address reducing the mismatch between training and testing conditions for hands-free in-car speech recognition. It is well known that the distortions caused by background noise, channel effects, etc., are highly nonlinear in the log-spectral or cepstral domain. This letter introduces a joint sparse representation (JSR) to estimate the underlying clean feature vector from a noisy feature vector. Performing a joint dictionary learning by sharing the same representation coefficients, the proposed method intends to capture the complex relationships (or mapping functions) between clean and noisy speech. Speech recognition experiments on realistic in-car data demonstrate that the proposed method shows excellent recognition performance with a relative improvement of 39.4% compared with the “baseline” frontends.
  • Keywords
    cepstral analysis; signal denoising; speech recognition; vectors; JSR; background noise; cepstral domain; channel effects; clean feature vector; feature denoising; hands-free in-car speech recognition; joint dictionary learning; joint sparse representation; log-spectral domain; mapping functions; noisy feature vector; realistic in-car data; recognition performance; relative improvement; representation coefficients; speech recognition experiments; Dictionaries; Joints; Noise measurement; Speech; Speech recognition; Testing; Training; Dictionary training; in-car speech recognition; log mel-filter bank (MFB) outputs; sparse representation;
  • fLanguage
    English
  • Journal_Title
    Signal Processing Letters, IEEE
  • Publisher
    ieee
  • ISSN
    1070-9908
  • Type

    jour

  • DOI
    10.1109/LSP.2013.2245894
  • Filename
    6458985