• DocumentCode
    3517755
  • Title

    A multistage approach for blind separation of convolutive speech mixtures

  • Author

    Jan, Tariqullah ; Wang, Wenwu ; Wang, DeLiang

  • Author_Institution
    Centre for Vision, Speech & Signal Process., Univ. of Surrey, Guildford
  • fYear
    2009
  • fDate
    19-24 April 2009
  • Firstpage
    1713
  • Lastpage
    1716
  • Abstract
    In this paper, we propose a novel algorithm for the separation of convolutive speech mixtures using two-microphone recordings, based on the combination of independent component analysis (ICA) and ideal binary mask (IBM), together with a post-filtering process in the cepstral domain. Essentially, the proposed algorithm consists of three steps. First, a constrained convolutive ICA algorithm is applied to separate the source signals from two-microphone recordings. In the second step, we estimate the IBM by comparing the energy of corresponding time-frequency (T-F) units from the separated sources obtained with the convolutive ICA algorithm. The last step is to reduce musical noise caused typically by T-F masking using cepstral smoothing. The performance of the proposed approach is evaluated based on both reverberant mixtures generated using a simulated room model and real recordings. The proposed algorithm offers considerably higher efficiency, together with improved speech quality while producing similar separation performance as compared with a recent approach.
  • Keywords
    blind source separation; independent component analysis; smoothing methods; speech processing; T-F masking; blind separation; cepstral smoothing; convolutive speech mixtures; ideal binary mask; independent component analysis; multistage approach; musical noise; post-filtering process; two-microphone recordings; Acoustic noise; Cepstral analysis; Independent component analysis; Interference; Mathematical model; Signal processing algorithms; Smoothing methods; Speech analysis; Speech coding; Speech processing; Independent component analysis (ICA); cepstral smoothing; estimated binary mask; ideal binary mask (IBM); musical noise;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
  • Conference_Location
    Taipei
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-2353-8
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2009.4959933
  • Filename
    4959933