DocumentCode
730664
Title
Joint optimization of anatomical and gestural parameters in a physical vocal tract model
Author
Liberatore, Christopher ; Gutierrez-Osuna, Ricardo
Author_Institution
Dept. of Comput. Sci. & Eng., Texas A&M Univ., College Station, TX, USA
fYear
2015
fDate
19-24 April 2015
Firstpage
4250
Lastpage
4254
Abstract
We describe a method for adapting a physical vocal tract model´s anatomical and gestural parameters using acoustic information to match a target speaker. Physical vocal tract models are hard to adjust to match a speaker, as doing so requires information which is difficult to capture, such as X-Ray or MRI information. We propose an analysis-by-synthesis approach to adjust the parameters of the VocalTractLab (VTL) physical vocal tract model, optimizing on an acoustic distance objective function. We compare our method with one which does not adjust anatomy parameters, just gestural parameters, and find that the proposed method results in a net improvement. We also test our method´s ability to recreate a synthetic speaker for which the ground truth parameters are known, and find that the method can reproduce the speaker if parameters pertaining to teeth and lips are fixed.
Keywords
optimisation; speech; speech synthesis; MRI information; VTL; VocalTractLab; X-Ray information; acoustic distance objective function; acoustic information; analysis-by-synthesis approach; anatomical parameters; gestural parameters; ground truth parameters; lips; optimization; physical vocal tract model; target speaker matching; teeth; Acoustics; Adaptation models; Computational modeling; Linear programming; Optimization; Shape; Speech; gestures; optimization; speaker inversion; vocal tract model;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
Conference_Location
South Brisbane, QLD
Type
conf
DOI
10.1109/ICASSP.2015.7178772
Filename
7178772
Link To Document