• DocumentCode
    20006
  • Title

    Modeling the Vocal Tract Transfer Function Using a 3D Digital Waveguide Mesh

  • Author

    Speed, Matt ; Murphy, Damian ; Howard, David

  • Author_Institution
    Dept. of Electron., Univ. of York, Heslington, UK
  • Volume
    22
  • Issue
    2
  • fYear
    2014
  • fDate
    Feb. 2014
  • Firstpage
    453
  • Lastpage
    464
  • Abstract
    The digital waveguide mesh has been shown to be capable of reproducing the acoustic impulse response of cylindrical vocal tract analogs. This study extends the same methodology to three-dimensional simulation of the acoustic response of graphical models of the vocal tract obtained from magnetic resonance imaging for a group of trained subjects. By such simulation of the vocal tract transfer function and convolution with an appropriate source waveform, basic phonemes are resynthesized and compared with benchmark audio recordings. The technologies and techniques used for simulation are described, alongside the protocol for image capture and the process for collection of benchmark audio. The results of simulation and acoustic recording are then evaluated and compared. The value of three-dimensional simulation in comparison to existing lower-dimensionality equivalents is assessed. It is found that while three-dimensional simulation provides a strong representation of the low frequency vocal tract transfer function, at higher frequencies its performance becomes geometry-dependent. MRI imaging and benchmark audio is provided for future studies and to permit comparison with comparable means of acoustic simulation.
  • Keywords
    acoustic signal processing; transfer functions; 3D digital waveguide mesh; MRI imaging; acoustic impulse response; acoustic recording; basic phonemes; benchmark audio collection; benchmark audio recordings; cylindrical vocal tract analogs; graphical models; image capture; low-frequency vocal tract transfer function representation; lower-dimensionality equivalents; magnetic resonance imaging; source waveform; three-dimensional simulation; vocal tract transfer function modeling; vocal tract transfer function simulation; Acoustic waveguides; Magnetic resonance imaging; Numerical models; Solid modeling; Transfer functions; Signal processing; acoustic signal processing; human voice; speech synthesis;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE/ACM Transactions on
  • Publisher
    ieee
  • ISSN
    2329-9290
  • Type

    jour

  • DOI
    10.1109/TASLP.2013.2294579
  • Filename
    6680751