• DocumentCode
    1688553
  • Title

    Joint sparse representation based cepstral-domain dereverberation for distant-talking speech recognition

  • Author

    Weifeng Li ; Longbiao Wang ; Fei Zhou ; Qingmin Liao

  • Author_Institution
    Dept. of Electron. Eng., Tsinghua Univ., Shenzhen, China
  • fYear
    2013
  • Firstpage
    7117
  • Lastpage
    7120
  • Abstract
    In this paper we address reducing the mismatch between training and testing conditions for robust distant-talking speech recognition under realistic reverberant environments. It is well known that the distortions caused by reverberation, background noise, etc., are highly nonlinear in the cepstral domain. In this paper we propose to capture the complex relationships between clean and reverberant speech via joint dictionary learning. Given a test reverberant speech with a sequence of feature vectors we first find their sparse representations, and then estimate the underlying clean feature vectors using the dictionary of clean speech. Based on speech recognition experiments conducted under realistic reverberation conditions, the proposed method is shown to perform very well, resulting in an average relative improvement of 59.1% compared with the baseline front-ends.
  • Keywords
    learning (artificial intelligence); speech recognition; cepstral domain dereverberation; clean speech dictionary; distant talking speech recognition; feature vectors; joint dictionary learning; joint sparse representation; realistic reverberant environments; Abstracts; Mel-Frequency Cepstral Coefficients (MFCCs); blind dereverberation; reverberation-robust speech recognition; sparse representation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
  • Conference_Location
    Vancouver, BC
  • ISSN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2013.6639043
  • Filename
    6639043