• DocumentCode
    2147953
  • Title

    An HNM-Based Speaker-Nonspecific Timbre Transformation Scheme for Speech Synthesis

  • Author

    Gu, Hung-Yan ; Cai, Chen-Lin ; Cai, Song-Fong

  • Author_Institution
    Nat. Taiwan Univ. of Sci. & Technol., Taipei, Taiwan
  • fYear
    2009
  • fDate
    17-19 Oct. 2009
  • Firstpage
    1
  • Lastpage
    5
  • Abstract
    In this paper, the harmonic-plus-noise model (HNM) based speech signal synthesis scheme studied previously is further extended to provide the function of speaker nonspecific timbre transformation. To transform synthetic speech´s timbre, we have developed a formant based frequency mapping method called piece-wise linear frequency mapping (PLFM). In addition, a commonly adopted method is frequency axis scaling (FAS). Both methods have been integrated into our HNM speech synthesis scheme, and a realtime synthesis system is implemented according to this scheme. The perception test results show that the proposed scheme can indeed transform the source timbre of a female adult into the timbre of a male adult, boy, or girl. In addition, the method PLFM is shown to be better than FAS for obtaining more manful timbre.
  • Keywords
    piecewise linear techniques; speech synthesis; HNM-based speaker-nonspecific timbre transformation; frequency axis scaling; harmonic-plus-noise model; piecewise linear frequency mapping; speech signal synthesis; speech synthesis; Birth disorders; Electronic mail; Frequency conversion; Hidden Markov models; Piecewise linear techniques; Signal synthesis; Signal to noise ratio; Speech synthesis; Timbre; Vocoders;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Image and Signal Processing, 2009. CISP '09. 2nd International Congress on
  • Conference_Location
    Tianjin
  • Print_ISBN
    978-1-4244-4129-7
  • Electronic_ISBN
    978-1-4244-4131-0
  • Type

    conf

  • DOI
    10.1109/CISP.2009.5303818
  • Filename
    5303818