• DocumentCode
    417690
  • Title

    Fast convergence speech source separation in reverberant acoustic environment

  • Author

    Zhao, Yunxin ; Hu, Rong

  • Author_Institution
    Dept. of Comput. Sci., Missouri Univ., Columbia, MO, USA
  • Volume
    3
  • fYear
    2004
  • fDate
    17-21 May 2004
  • Abstract
    Three significant enhancements to time-domain adaptive decorrelation filtering (ADF) are proposed for effective separation and recognition of simultaneous speech sources in reverberant room conditions. The methods include whitening filtering on cochannel speech prior to ADF to improve condition of adaptive estimation, a novel block-iterative implementation of ADF to speed up convergence rate, and an integration of multiple ADF outputs through optimal post-filtering. Experimental data were generated by convolving TIMIT speech with acoustic path impulse responses measured in a real acoustic environment, with a 2m microphone-source distance and an initial target-to-interference ratio of about 0 dB. The proposed methods are shown to have speeded up the convergence rate of ADF to a level feasible for online applications, and they have significantly improved target-to-interference ratio and accuracy of phone recognition.
  • Keywords
    acoustic convolution; adaptive estimation; adaptive filters; blind source separation; convergence of numerical methods; decorrelation; iterative methods; optimisation; reverberation; speech recognition; transient response; ADF; TIMIT speech; acoustic path impulse responses; adaptive estimation; block-iterative implementation; cochannel speech; convergence; convolution; optimal post-filtering; phone recognition accuracy; reverberant acoustic environment; reverberant room; simultaneous speech recognition; speech source separation; target-to-interference ratio; time-domain adaptive decorrelation filtering; whitening filtering; Acoustic measurements; Adaptive estimation; Adaptive filters; Convergence; Decorrelation; Filtering; Source separation; Speech enhancement; Speech recognition; Time domain analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-8484-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.2004.1326690
  • Filename
    1326690