Title : 
Parameterized VT area function inversion
         
        
            Author : 
Båvegård, Mats ; Fant, Gunnar
         
        
            Author_Institution : 
Dept. of Speech, Music & Hearing, KTH, Stockholm, Sweden
         
        
        
        
        
        
            Abstract : 
The purpose of the study is to contribute tools for inversion of articulatory to acoustics relations, specifically to perform an estimate of vocal tract area-function parameters from formant frequencies. The inversion is performed in two steps. A first approximation is attained from either a codebook or a neural net and a final optimization is performed by an iterative interpolation for finding a perfect or acceptable match. The study is based on a three-parameter vocal tract model. The codebook relates each of the possible combinations of constriction location, Xc, constriction area, Ac and the lip parameter, l0/A0 to a corresponding F 1, F2, F3 pattern. The neural network output provides the same choice of possible VT states as the codebook. The input to the neural network is normally programmed in terms of formant frequencies but other acoustic attributes can be selected or added. Present experience is limited to vocalic area functions. The present system provides a rapid conversion of formant frequency data to VT parameters and has provided promising results for short sentences
         
        
            Keywords : 
interpolation; iterative methods; neural nets; optimisation; speech coding; acceptable match; acoustic attributes; acoustic relations; articulatory relation inversion; codebook; constriction area; constriction location; formant frequencies; formant frequency data conversion; iterative interpolation; lip parameter; neural net; optimization; parameterized vocal tract area function inversion; perfect match; short sentences; three-parameter vocal tract model; vocal tract area-function parameters; vocalic area functions; Acoustics; Auditory system; Frequency conversion; Frequency estimation; Interpolation; Larynx; Music; Neural networks; Shape control; Speech;
         
        
        
        
            Conference_Titel : 
Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
         
        
            Conference_Location : 
Philadelphia, PA
         
        
            Print_ISBN : 
0-7803-3555-4
         
        
        
            DOI : 
10.1109/ICSLP.1996.607762