DocumentCode
290032
Title
Determination of human vocal-tract dynamic geometry from formant trajectories using spatial and temporal Fourier analysis
Author
Yehia, H. ; Itakura, Fumitada
Author_Institution
Sch. of Eng., Nagoya Univ., Japan
Volume
i
fYear
1994
fDate
19-22 Apr 1994
Abstract
This article presents a method of estimation of the vocal-tract cross-sectional area, considered as a function of time and position along the tract length. The estimation is based on the speech formant frequencies, and uses a priori information about natural tract configurations. In general lines, the method is as follows. First, the cross-sectional area is represented by a two-dimensional Fourier cosine series expansion in time and space. Then, the locally linear relationship between spatial Fourier coefficients and formant frequencies is explored to formulate an acoustical constraint in the coefficient space. Finally, the sequence of vocal-tract areas corresponding to a given sequence of formants is estimated under positional, dynamical, and acoustical constraints. The system behavior is shown first for the static case of vowels and, then, for the dynamic case of vowel-to-vowel transitions. The method can be used as a bridge between articulatory parameter models and the speech parameter space. Moreover, it is potentially useful for area driven coders and synthesizers
Keywords
Fourier analysis; speech coding; speech processing; speech synthesis; acoustical constraint; area driven coders; area driven synthesizers; articulatory parameter models; coefficient space; dynamical constraints; formant trajectories; human vocal-tract dynamic geometry; natural tract configurations; positional constraints; spatial Fourier analysis; spatial Fourier coefficients; speech acoustical parameters; speech formant frequencies; speech parameter space; temporal Fourier analysis; two-dimensional Fourier cosine series expansion; vocal tract length; vocal-tract cross-sectional area; vowel-to-vowel transitions; vowels; Art; Costs; Data mining; Humans; Information geometry; Speech processing; Speech synthesis; Synthesizers; Time factors;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 1994. ICASSP-94., 1994 IEEE International Conference on
Conference_Location
Adelaide, SA
ISSN
1520-6149
Print_ISBN
0-7803-1775-0
Type
conf
DOI
10.1109/ICASSP.1994.389252
Filename
389252
Link To Document