DocumentCode :
3161721
Title :
Multichannel speech dereverberation and separation with optimized combination of linear and non-linear filtering
Author :
Togami, Masahito ; Kawaguchi, Yohei ; Takeda, Ryu ; Obuchi, Yasunari ; Nukaga, Nobuo
Author_Institution :
Central Res. Lab., Hitachi Ltd., Kokubunji, Japan
fYear :
2012
fDate :
25-30 March 2012
Firstpage :
4057
Lastpage :
4060
Abstract :
In this paper, we propose a multichannel speech dereverberation and separation technique which is effective even when there are multiple speakers and each speaker´s transfer function is time-varying due to fluctuation of the corresponding speaker´s head. For robustness against fluctuation, the proposed method optimizes linear filtering with non-linear filtering simultaneously from probabilistic perspective based on a probabilistic reverberant transfer-function model, PRTFM. PRTFM is an extension of the conventional time-invariant transfer-function model under uncertain conditions, and PRTFM can be also regarded as an extension of recently proposed blind local Gaussian modeling. The linear filtering and the non-linear filtering are optimized in MMSE (Minimum Mean Square Error) sense during parameter optimization. The proposed method is evaluated in a reverberant meeting room, and the proposed method is shown to be effective.
Keywords :
Gaussian processes; least mean squares methods; nonlinear filters; nonlinear programming; speech processing; transfer functions; MMSE; PRTFM; blind local Gaussian modeling; minimum mean square error; multichannel speech dereverberation technique; multichannel speech separation technique; nonlinear filtering optimization; parameter optimization; probabilistic reverberant transfer-function model; reverberant meeting room; speaker transfer function; time-invariant transfer-function model; Covariance matrix; Microphones; Probabilistic logic; Reverberation; Speech; Transfer functions; Dereverberation; Local Gaussian modeling; Multichannel Wiener filter; Speech separation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location :
Kyoto
ISSN :
1520-6149
Print_ISBN :
978-1-4673-0045-2
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2012.6288809
Filename :
6288809
Link To Document :
بازگشت