DocumentCode :
3073615
Title :
Optimal estimators for spectral restoration of noisy speech
Author :
Porter, Jack E. ; Boll, Steven F.
Author_Institution :
ITT Defense Communications Division, San Diego, California
Volume :
9
fYear :
1984
fDate :
30742
Firstpage :
53
Lastpage :
56
Abstract :
Acoustic noise suppression is treated as a problem of finding the minimum mean square error estimate of the speech spectrum from a noisy version. This estimate equals the expected value of its conditional distribution given the noisy spectral value, the mean noise power and the mean speech power. It is shown that speech is not Gaussian. This results in an optimal estimate which is a non-linear function of the spectral magnitude. This function differs from the Wiener filter, especially at high instantaneous signal-to-noise ratios. Since both speech and Gaussian noise have a uniform phase distribution, the optimal estimator of the phase equals the noisy phase. The paper describes how the estimator can be calculated directly from noise-free speech. It describes how to find the optimal estimator for the complex spectrum, the magnitude, the squared magnitude, the log magnitude, and the root-magnitude spectra. Results for a speaker dependent connected digit speech recognition task with a base error rate of 1.6%, show that preprocessing the noisy unknown speech with a 10 dB signal-to-noise ratio reduces the error rate from 42% to 10%. If the template data are also preprocessed in the same way, the error rate reduces to 2.1%, thus recovering 99% of the recognition performance lost due to noise.
Keywords :
Acoustic noise; Error analysis; Gaussian noise; Mean square error methods; Noise reduction; Phase estimation; Phase noise; Signal to noise ratio; Speech enhancement; Wiener filter;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP '84.
Type :
conf
DOI :
10.1109/ICASSP.1984.1172545
Filename :
1172545
Link To Document :
بازگشت