DocumentCode
20006
Title
Modeling the Vocal Tract Transfer Function Using a 3D Digital Waveguide Mesh
Author
Speed, Matt ; Murphy, Damian ; Howard, David
Author_Institution
Dept. of Electron., Univ. of York, Heslington, UK
Volume
22
Issue
2
fYear
2014
fDate
Feb. 2014
Firstpage
453
Lastpage
464
Abstract
The digital waveguide mesh has been shown to be capable of reproducing the acoustic impulse response of cylindrical vocal tract analogs. This study extends the same methodology to three-dimensional simulation of the acoustic response of graphical models of the vocal tract obtained from magnetic resonance imaging for a group of trained subjects. By such simulation of the vocal tract transfer function and convolution with an appropriate source waveform, basic phonemes are resynthesized and compared with benchmark audio recordings. The technologies and techniques used for simulation are described, alongside the protocol for image capture and the process for collection of benchmark audio. The results of simulation and acoustic recording are then evaluated and compared. The value of three-dimensional simulation in comparison to existing lower-dimensionality equivalents is assessed. It is found that while three-dimensional simulation provides a strong representation of the low frequency vocal tract transfer function, at higher frequencies its performance becomes geometry-dependent. MRI imaging and benchmark audio is provided for future studies and to permit comparison with comparable means of acoustic simulation.
Keywords
acoustic signal processing; transfer functions; 3D digital waveguide mesh; MRI imaging; acoustic impulse response; acoustic recording; basic phonemes; benchmark audio collection; benchmark audio recordings; cylindrical vocal tract analogs; graphical models; image capture; low-frequency vocal tract transfer function representation; lower-dimensionality equivalents; magnetic resonance imaging; source waveform; three-dimensional simulation; vocal tract transfer function modeling; vocal tract transfer function simulation; Acoustic waveguides; Magnetic resonance imaging; Numerical models; Solid modeling; Transfer functions; Signal processing; acoustic signal processing; human voice; speech synthesis;
fLanguage
English
Journal_Title
Audio, Speech, and Language Processing, IEEE/ACM Transactions on
Publisher
ieee
ISSN
2329-9290
Type
jour
DOI
10.1109/TASLP.2013.2294579
Filename
6680751
Link To Document