Title :
Study of the Noise-Reduction Problem in the Karhunen–LoÈve Expansion Domain
Author :
Jingdong Chen ; Benesty, J. ; Yiteng Huang
Author_Institution :
Bell Labs., Alcatel-Lucent, Murray, NJ
fDate :
5/1/2009 12:00:00 AM
Abstract :
Noise reduction, which aims at estimating a clean speech from a noisy observation, has long been an active research area. The standard approach to this problem is to obtain the clean speech estimate by linearly filtering the noisy signal. The core issue, then, becomes how to design an optimal linear filter that can significantly suppress noise without introducing perceptually noticeable speech distortion. Traditionally, the optimal noise-reduction filters are formulated in either the time or the frequency domains. This paper studies the problem in the Karhunen-Loeve expansion domain. We develop two classes of optimal filters. The first class achieves a frame of speech estimate by filtering the corresponding frame of the noisy speech. We will show that many existing methods such as the widely used Wiener filter and subspace technique are closely related to this category. The second class obtains noise reduction by filtering not only the current frame, but also a number of previous consecutive frames of the noisy speech. We will discuss how to design the optimal noise-reduction filters in each class and demonstrate, through both theoretical analysis and experiments, the properties of the deduced optimal filters.
Keywords :
Karhunen-Loeve transforms; filtering theory; speech enhancement; Karhunen-Loeve expansion domain; noise-reduction filter; optimal linear filter; speech distortion; Discrete Fourier transforms; Filtering; Frequency domain analysis; Noise reduction; Nonlinear filters; Signal processing; Signal to noise ratio; Speech enhancement; Speech processing; Wiener filter; Karhunen–LoÈve expansion (KLE); Pearson correlation coefficient; Wiener filter; maximum signal-to-noise ratio (SNR) filter; noise reduction; speech enhancement; subspace approach;
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
DOI :
10.1109/TASL.2009.2014793