• DocumentCode
    44841
  • Title

    Multichannel Equalization in the KLT and Frequency Domains With Application to Speech Dereverberation

  • Author

    Rashobh, Rajan S. ; Khong, Andy W. H. ; Di Liu

  • Author_Institution
    Sch. of Electr. & Electron. Eng., Nanyang Technol. Univ., Singapore, Singapore
  • Volume
    22
  • Issue
    3
  • fYear
    2014
  • fDate
    Mar-14
  • Firstpage
    634
  • Lastpage
    646
  • Abstract
    Equalization of acoustic channels usually involves inversion of acoustic impulse responses (AIRs), and generally employs multichannel techniques. In this paper, we propose three equalization algorithms, one in the Karhunen-Loève transform (KLT) domain and the other two in the frequency domain. Our proposed algorithm in the KLT domain provides a platform to achieve equalization in conjunction with denoising. Existing multiple-input/output inverse theorem (MINT)-based non-adaptive algorithms require the inversion of a matrix with dimension that is proportional to the AIR length, and is computationally expensive. To overcome this limitation, we propose the frequency-domain algorithm which is computationally very efficient and thus can be employed for the equalization of high-order AIRs in practical applications. In addition, the frequency-domain method is more robust to AIR estimation errors. To achieve further reduction in the complexity without significant performance degradation, we then propose a modified version of the frequency-domain algorithm.
  • Keywords
    Karhunen-Loeve transforms; equalisers; signal denoising; speech processing; AIR estimation error; KLT; Karhunen-Loeve transform; acoustic channel equalization; acoustic impulse response; frequency domain algorithm; multichannel equalization; signal denoising; speech dereverberation; Acoustics; Complexity theory; Convergence; Estimation error; Frequency-domain analysis; Speech; Transforms; Acoustic microphone array; multichannel equalization; speech dereverberation;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE/ACM Transactions on
  • Publisher
    ieee
  • ISSN
    2329-9290
  • Type

    jour

  • DOI
    10.1109/TASLP.2013.2297013
  • Filename
    6698384