• DocumentCode
    3340212
  • Title

    A novel syllable duration modeling approach for Mandarin speech

  • Author

    Lai, Wen-Hsing ; Chen, Sin-Horng

  • Author_Institution
    Dept. of Commun. Eng., Nat. Chiao Tung Univ., Hsinchu, Taiwan
  • Volume
    1
  • fYear
    2001
  • fDate
    2001
  • Firstpage
    93
  • Abstract
    In this paper, a novel syllable duration modeling approach for Mandarin speech is proposed. It explicitly takes several main affecting factors as multiplicative companding parameters and estimates all model parameters by an EM algorithm. Experimental results show that the variance of the observed syllable duration is greatly reduced from 183.4 frame2 (1 frame=5 ms) to 18.5 frame2 by eliminating effects from these affecting factors. Besides, the estimated companding values of these affecting factors agree well with our prior linguistic knowledge. A preliminary study of applying the proposed model to predict syllable duration for TTS is also performed. Experimental results show that it outperforms the conventional regressive prediction method. Lastly, an extension of the approach to incorporate initial and final duration modeling is presented. This leads to a better understanding of the relation between the companding factors of initial and final duration models and those of syllable duration model
  • Keywords
    iterative methods; natural languages; parameter estimation; speech synthesis; EM algorithm; Mandarin speech; TTS; companding factors; expectation maximization algorithm; final duration modeling; initial duration modeling; multiplicative companding parameters; prosody; syllable duration modeling; Automatic speech recognition; Frequency; Hidden Markov models; Laboratories; Natural languages; Parameter estimation; Prediction methods; Predictive models; Speech synthesis; Timing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2001. Proceedings. (ICASSP '01). 2001 IEEE International Conference on
  • Conference_Location
    Salt Lake City, UT
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-7041-4
  • Type

    conf

  • DOI
    10.1109/ICASSP.2001.940775
  • Filename
    940775