DocumentCode
2813064
Title
A parametric three-dimensional model of the vocal-tract based on MRI data
Author
Yehia, Hani ; Tiede, Mark
Author_Institution
ATR Human Inf. Res. Labs., Kyoto, Japan
Volume
3
fYear
1997
fDate
21-24 Apr 1997
Firstpage
1619
Abstract
Twenty four three-dimensional (3D) vocal-tract (VT) shapes extracted from MRI data are used to derive a parametric model for the vocal-tract. The method is as follows: first, each 3D VT shape is sampled using a semi-cylindrical grid whose position is determined by reference points based on the VT anatomy. After that, the VT projections onto each plane of the grid are represented by their two main components obtained via principal component analysis (PCA). PCA is once again used to parametrize the sequences of coefficients that represent the sections along the tract. It was verified that the first four components can explain about 90% of the total variance of the observed shapes. Following this procedure, 3D VT shapes are approximated by linear combinations of four 3D basis functions. Finally, it is shown that the four parameters of the model can be estimated from the VT midsagittal profiles
Keywords
biomedical NMR; parameter estimation; physiological models; signal sampling; speech processing; 3D basis functions; 3D vocal tract shapes; MRI data; PCA; articulatory speech processes; coefficients; observed shape variance; parametric model; parametric three-dimensional model; principal component analysis; reference points; sampling; semicylindrical grid; vocal tract midsagittal profiles; Anatomy; Data mining; Humans; Laboratories; Light rail systems; Magnetic resonance imaging; Principal component analysis; Shape; Speech processing; Speech synthesis;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 1997. ICASSP-97., 1997 IEEE International Conference on
Conference_Location
Munich
ISSN
1520-6149
Print_ISBN
0-8186-7919-0
Type
conf
DOI
10.1109/ICASSP.1997.598809
Filename
598809
Link To Document