DocumentCode :
312143
Title :
Parameterized VT area function inversion
Author :
Båvegård, Mats ; Fant, Gunnar
Author_Institution :
Dept. of Speech, Music & Hearing, KTH, Stockholm, Sweden
Volume :
2
fYear :
1996
fDate :
3-6 Oct 1996
Firstpage :
961
Abstract :
The purpose of the study is to contribute tools for inversion of articulatory to acoustics relations, specifically to perform an estimate of vocal tract area-function parameters from formant frequencies. The inversion is performed in two steps. A first approximation is attained from either a codebook or a neural net and a final optimization is performed by an iterative interpolation for finding a perfect or acceptable match. The study is based on a three-parameter vocal tract model. The codebook relates each of the possible combinations of constriction location, Xc, constriction area, Ac and the lip parameter, l0/A0 to a corresponding F 1, F2, F3 pattern. The neural network output provides the same choice of possible VT states as the codebook. The input to the neural network is normally programmed in terms of formant frequencies but other acoustic attributes can be selected or added. Present experience is limited to vocalic area functions. The present system provides a rapid conversion of formant frequency data to VT parameters and has provided promising results for short sentences
Keywords :
interpolation; iterative methods; neural nets; optimisation; speech coding; acceptable match; acoustic attributes; acoustic relations; articulatory relation inversion; codebook; constriction area; constriction location; formant frequencies; formant frequency data conversion; iterative interpolation; lip parameter; neural net; optimization; parameterized vocal tract area function inversion; perfect match; short sentences; three-parameter vocal tract model; vocal tract area-function parameters; vocalic area functions; Acoustics; Auditory system; Frequency conversion; Frequency estimation; Interpolation; Larynx; Music; Neural networks; Shape control; Speech;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
Conference_Location :
Philadelphia, PA
Print_ISBN :
0-7803-3555-4
Type :
conf
DOI :
10.1109/ICSLP.1996.607762
Filename :
607762
Link To Document :
بازگشت