• DocumentCode
    900387
  • Title

    Waveguide physical modeling of vocal tract acoustics: flexible formant bandwidth control from increased model dimensionality

  • Author

    Mullen, Jack ; Howard, David M. ; Murphy, Damian T.

  • Author_Institution
    Media Eng. Group, York Univ., UK
  • Volume
    14
  • Issue
    3
  • fYear
    2006
  • fDate
    5/1/2006 12:00:00 AM
  • Firstpage
    964
  • Lastpage
    971
  • Abstract
    Digital waveguide physical modeling is often used as an efficient representation of acoustical resonators such as the human vocal tract. Building on the basic one-dimensional (1-D) Kelly-Lochbaum tract model, various speech synthesis techniques demonstrate improvements to the wave scattering mechanisms in order to better approximate wave propagation in the complex vocal system. Some of these techniques are discussed in this paper, with particular reference to an alternative approach in the form of a two-dimensional waveguide mesh model. Emphasis is placed on its ability to produce vowel spectra similar to that which would be present in natural speech, and how it improves upon the 1-D model. Tract area function is accommodated as model width, rather than translated into acoustic impedance, and as such offers extra control as an additional bounding limit to the model. Results show that the two-dimensional (2-D) model introduces approximately linear control over formant bandwidths leading to attainable realistic values across a range of vowels. Similarly, the 2-D model allows for application of theoretical reflection values within the tract, which when applied to the 1-D model result in small formant bandwidths, and, hence, unnatural sounding synthesized vowels.
  • Keywords
    acoustic resonators; acoustic wave scattering; acoustic waveguides; bandwidth allocation; speech synthesis; Kelly-Lochbaum tract model; acoustical resonators; digital waveguide physical modeling; flexible formant bandwidth control; two-dimensional waveguide mesh model; vocal tract acoustics; wave scattering mechanisms; Acoustic propagation; Acoustic scattering; Acoustic waveguides; Acoustic waves; Bandwidth; Human voice; Impedance; Natural languages; Speech synthesis; Waveguide components; Acoustic resonators; acoustic waveguides; speech synthesis; vocal system;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1558-7916
  • Type

    jour

  • DOI
    10.1109/TSA.2005.858052
  • Filename
    1621208