Title :
Single-Microphone LP Residual Skewness-Based Inverse Filtering of the Room Impulse Response
Author :
Mosayyebpour, Saeed ; Sheikhzadeh, Hamid ; Gulliver, T. Aaron ; Esmaeili, Morteza
Author_Institution :
Dept. of Electr. & Comput. Eng., Univ. of Victoria, Victoria, BC, Canada
fDate :
7/1/2012 12:00:00 AM
Abstract :
This paper presents a method based on higher order statistics (HOS), namely the normalized third-order moment (skewness), for blind estimation of the inverse filter of the room impulse response (RIR). Skewness is used as a measure of asymmetry, and a comprehensive comparison with the commonly used metric (kurtosis) is presented. It is shown that a sufficiently long linear predictive (LP) residual of the speech signal has an asymmetric pdf with sufficient skewness to be used as a score function for the HOS-based approach. The proposed algorithm is optimized for the inverse filter estimation problem. This optimization includes an efficient initialization for high reverberation intensities, enabling the method to be employed in highly reverberant rooms. The direct-to-reverberation ratio (DRR) as well as the equalized impulse response clearly show that our method can estimate the inverse filter even in highly reverberant environments. In addition, performance results using recorded background noise and in time-varying environments illustrate that our approach is applicable in real world situations. The proposed method is shown to be superior to the method by Wu and Wang, particularly in terms of reducing the coloration effect. Experiments under different acoustic conditions confirm the effectiveness of the proposed method for time delay estimation (TDE). Finally, the proposed algorithm is used as the first-stage of monaural segregation, and it is shown to improve the performance under different conditions.
Keywords :
delays; filtering theory; higher order statistics; microphones; reverberation; speech processing; transient response; background noise; blind estimation; coloration effect; direct-to-reverberation ratio; equalized impulse response; inverse filter estimation; linear predictive residual; monaural segregation; normalized third-order moment; reverberation intensity; room impulse response; score function; single-microphone LP residual skewness; speech signal; time delay estimation; time-varying environments; Channel estimation; Estimation; Higher order statistics; Probability density function; Reverberation; Speech; Speech processing; Higher order statistics (HOS); inverse filtering; linear prediction (LP) residual; room impulse response (RIR); single-microphone; skewness;
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
DOI :
10.1109/TASL.2012.2186804