Title :
Auditory modeling applied to formant tracking of noise-corrupted speech
Author :
Metz, S.W. ; Heinen, James A. ; Niederjohn, Russell J. ; Sreenivas, Thippur V.
Author_Institution :
Dept. of Electr. & Comput. Eng., Marquette Univ., Milwaukee, WI, USA
fDate :
28 Oct-1 Nov 1991
Abstract :
The authors explore the use of the EIH (ensemble interval histogram) as the basis for formant frequency estimation for noise-corrupted speech. By measuring the percent RMS (root mean square) errors of the first three formants it was shown that the EIH is effective for formant frequency estimation. The authors also show the effects that some of the model parameters have on formant tracking and conclude that the uniform IIR (infinite impulse response) filter bank with eight linear level-crossings is best suited for formant tracking of noise-corrupted speech. The minimal influence of these parameter changes indicates the robustness of the auditory model. Results obtained provide strong evidence that formant frequency estimation based on the EIH may be more accurate for noise-corrupted speech than results achieved with traditional methods
Keywords :
filtering and prediction theory; hearing; physiological models; spectral analysis; speech analysis and processing; IIR filter bank; RMS errors; auditory model; ensemble interval histogram; formant frequency estimation; formant tracking; noise corrupted speech tracking; physiological model; speech processing; Auditory system; Ear; Frequency estimation; Humans; Psychoacoustic models; Speech analysis; Speech enhancement; Speech processing; Speech recognition; Working environment noise;
Conference_Titel :
Industrial Electronics, Control and Instrumentation, 1991. Proceedings. IECON '91., 1991 International Conference on
Conference_Location :
Kobe
Print_ISBN :
0-87942-688-8
DOI :
10.1109/IECON.1991.239014