DocumentCode :
1252093
Title :
Formant estimation for speech recognition
Author :
Welling, Lutz ; Ney, Hermann
Author_Institution :
Dept. of Comput. Sci., Aachen Univ. of Technol., Germany
Volume :
6
Issue :
1
fYear :
1998
fDate :
1/1/1998 12:00:00 AM
Firstpage :
36
Lastpage :
48
Abstract :
This paper presents a new method for estimating formant frequencies. The formant model is based on a digital resonator. Each resonator represents a segment of the short-time power spectrum. The complete spectrum is modeled by a set of digital resonators connected in parallel. An algorithm based on dynamic programming produces both the model parameters and the segment boundaries that optimally match the spectrum. We used this method in experimental tests that were carried out on the TI digit string data base. The main results of the experimental tests are: (1) the presented approach produces reliable estimates of formant frequencies across a wide range of sounds and speakers; and (2) the estimated formant frequencies were used in a number of variants for recognition. The best set-up resulted in a string error rate of 4.2% on the adult corpus of the TI digit string data base
Keywords :
dynamic programming; frequency estimation; resonators; spectral analysis; speech processing; speech recognition; TI digit string data base; adult corpus; algorithm; digital resonators; dynamic programming; experimental tests; formant frequencies estimation; formant model; model parameters; segment boundaries; short-time power spectrum; sounds; speakers; speech recognition; string error rate; Acoustic testing; Context modeling; Dynamic programming; Error analysis; Frequency estimation; Heuristic algorithms; Linear predictive coding; Loudspeakers; Speech analysis; Speech recognition;
fLanguage :
English
Journal_Title :
Speech and Audio Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1063-6676
Type :
jour
DOI :
10.1109/89.650308
Filename :
650308
Link To Document :
بازگشت