DocumentCode
2147953
Title
An HNM-Based Speaker-Nonspecific Timbre Transformation Scheme for Speech Synthesis
Author
Gu, Hung-Yan ; Cai, Chen-Lin ; Cai, Song-Fong
Author_Institution
Nat. Taiwan Univ. of Sci. & Technol., Taipei, Taiwan
fYear
2009
fDate
17-19 Oct. 2009
Firstpage
1
Lastpage
5
Abstract
In this paper, the harmonic-plus-noise model (HNM) based speech signal synthesis scheme studied previously is further extended to provide the function of speaker nonspecific timbre transformation. To transform synthetic speech´s timbre, we have developed a formant based frequency mapping method called piece-wise linear frequency mapping (PLFM). In addition, a commonly adopted method is frequency axis scaling (FAS). Both methods have been integrated into our HNM speech synthesis scheme, and a realtime synthesis system is implemented according to this scheme. The perception test results show that the proposed scheme can indeed transform the source timbre of a female adult into the timbre of a male adult, boy, or girl. In addition, the method PLFM is shown to be better than FAS for obtaining more manful timbre.
Keywords
piecewise linear techniques; speech synthesis; HNM-based speaker-nonspecific timbre transformation; frequency axis scaling; harmonic-plus-noise model; piecewise linear frequency mapping; speech signal synthesis; speech synthesis; Birth disorders; Electronic mail; Frequency conversion; Hidden Markov models; Piecewise linear techniques; Signal synthesis; Signal to noise ratio; Speech synthesis; Timbre; Vocoders;
fLanguage
English
Publisher
ieee
Conference_Titel
Image and Signal Processing, 2009. CISP '09. 2nd International Congress on
Conference_Location
Tianjin
Print_ISBN
978-1-4244-4129-7
Electronic_ISBN
978-1-4244-4131-0
Type
conf
DOI
10.1109/CISP.2009.5303818
Filename
5303818
Link To Document