DocumentCode
3390699
Title
Improvement of Speech Source Localization in Noisy Environment Using Overcomplete Rational-Dilation Wavelet Transforms
Author
Di Liu ; Khong, Andy W H
Author_Institution
Sch. of Electr. & Electron. Eng., Nanyang Technol. Univ., Singapore, Singapore
fYear
2010
fDate
20-22 Oct. 2010
Firstpage
77
Lastpage
81
Abstract
The generalized cross-correlation using the phase transform prefilter remains popular for the estimation of time-differences-of-arrival. However it is not robust to noise and as a consequence, the performance of direction-of-arrival algorithms is often degraded under low signal-to-noise condition. We propose to address this problem through the use of a wavelet-based speech enhancement technique since the wavelet transform can achieve good denoising performance. The over complete rational-dilation wavelet transform is then exploited to effectively process speech signals due to its higher frequency resolution. In addition, we exploit the joint distribution of the speech in the wavelet domain and develop a novel local noise variance estimator based on the bivariate shrinkage function. As will be shown, our proposed algorithm achieves good direction-of-arrival performance in the presence of noise.
Keywords
acoustic signal detection; direction-of-arrival estimation; speech enhancement; wavelet transforms; bivariate shrinkage function; denoising performance; direction of arrival algorithm; frequency resolution; local noise variance estimator; noisy environment; phase transform prefilter; rational dilation wavelet transform; signal to noise condition; speech distribution; speech signal processing; speech source localization; wavelet based speech enhancement technique; Direction of arrival estimation; Estimation; Noise reduction; Signal to noise ratio; Speech; Wavelet transforms; DOA estimation; denoising; speech source localization; wavelet;
fLanguage
English
Publisher
ieee
Conference_Titel
Cyberworlds (CW), 2010 International Conference on
Conference_Location
Singapore
Print_ISBN
978-1-4244-8301-3
Electronic_ISBN
978-0-7695-4215-7
Type
conf
DOI
10.1109/CW.2010.69
Filename
5655062
Link To Document