• DocumentCode
    2801710
  • Title

    Morphological normalization of vocal tract shape

  • Author

    Wei, Jianguo ; Dang, Jianwu

  • Author_Institution
    Japan Adv. Inst. of Sci. & Technol., Ishikawa, Japan
  • fYear
    2010
  • fDate
    14-19 March 2010
  • Firstpage
    4186
  • Lastpage
    4189
  • Abstract
    The articulatory databases are not utilized so widely as acoustic databases. One of the reasons is the difficulty of reducing morphological variations among subjects. To reduce morphological differences in speech organs among speakers and remain their speech dynamics, this study proposed a framework of normalizing vocal tract by using a Thin-plate spline method. Electromagnetic Midsagittal Articulographic data for three subjects have been used in this research. The template for normalization was obtained by averaging all three subjects´ palates and tongue shapes. The landmarks of the template and subjects have been defined according to a gridline system of the vocal tract. The results show that the variances among subjects were reduced 0.8 mm in horizontal and 2.4 mm in vertical direction. The similar vowel structure of pre/post-normalization data indicates that speaker specific characteristics can be maintained by this framework. The effects of the normalization in acoustic space are also investigated by using a physiological articulatory model. Results show that the variations have also been reduced in acoustic space.
  • Keywords
    bioacoustics; biological organs; speech; acoustic databases; articulatory databases; electromagnetic midsagittal articulographic data; gridline system; morphological differences; morphological normalization; palates; physiological articulatory model; speech organs; thin-plate spline method; tongue shapes; vocal tract shape; vowel structure; Computer science; Databases; Kinematics; Loudspeakers; Magnetic resonance imaging; Shape; Speech; Spline; Surface morphology; Tongue; Articulatory data; Thin-plate spline; Vocal tract normalization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
  • Conference_Location
    Dallas, TX
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-4295-9
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2010.5495711
  • Filename
    5495711