An HNM-Based Speaker-Nonspecific Timbre Transformation Scheme for Speech Synthesis

Author

Gu, Hung-Yan ; Cai, Chen-Lin ; Cai, Song-Fong

Author_Institution

Nat. Taiwan Univ. of Sci. & Technol., Taipei, Taiwan

fYear

2009

fDate

17-19 Oct. 2009

Firstpage

1

Lastpage

5

Abstract

In this paper, the harmonic-plus-noise model (HNM) based speech signal synthesis scheme studied previously is further extended to provide the function of speaker nonspecific timbre transformation. To transform synthetic speech´s timbre, we have developed a formant based frequency mapping method called piece-wise linear frequency mapping (PLFM). In addition, a commonly adopted method is frequency axis scaling (FAS). Both methods have been integrated into our HNM speech synthesis scheme, and a realtime synthesis system is implemented according to this scheme. The perception test results show that the proposed scheme can indeed transform the source timbre of a female adult into the timbre of a male adult, boy, or girl. In addition, the method PLFM is shown to be better than FAS for obtaining more manful timbre.

Keywords

piecewise linear techniques; speech synthesis; HNM-based speaker-nonspecific timbre transformation; frequency axis scaling; harmonic-plus-noise model; piecewise linear frequency mapping; speech signal synthesis; speech synthesis; Birth disorders; Electronic mail; Frequency conversion; Hidden Markov models; Piecewise linear techniques; Signal synthesis; Signal to noise ratio; Speech synthesis; Timbre; Vocoders;

fLanguage

English

Publisher

ieee

Conference_Titel

Image and Signal Processing, 2009. CISP '09. 2nd International Congress on

Conference_Location

Tianjin

Print_ISBN

978-1-4244-4129-7

Electronic_ISBN

978-1-4244-4131-0

Type

conf

DOI

10.1109/CISP.2009.5303818

Filename

5303818