مرکز منطقه ای اطلاع رساني علوم و فناوري - Representing fundamental frequency contours generated by HMM-based speech synthesis using generation process model

DocumentCode :

2132493

Title :

Representing fundamental frequency contours generated by HMM-based speech synthesis using generation process model

Author :

Hirose, Keikichi ; Matsuda, Tatsuya ; Hashimoto, Hiroya ; Minematsu, Nobuaki

Author_Institution :

Dept. of Inf. & Commun. Eng., Univ. of Tokyo, Tokyo, Japan

fYear :

2011

fDate :

18-21 Sept. 2011

Firstpage :

Lastpage :

Abstract :

Frame-by-frame representation is not appropriate for prosodic features, which are tightly related to speech units spreading a wide time span, such as words, phrases and so on. This causes an inherit problem in fundamental frequency (F₀) contour generation by HMM-based speech synthesis. A method is developed to modify F₀ contours in the framework of a generation process model by referring to linguistic information of input text (word boundary and accent type). It takes F₀ variances obtained through HMM-based speech synthesis into account during the process. Through a listening experiment on synthetic speech, the method is proved to generate better quality as compared to the HMM-based speech synthesis on average. Since the generation process model can clearly relate its commands and linguistic (and para-/non- linguistic) information, the method has an additional advantage; changing speech styles, and /or adding further information (such as emphasis) can be easily done through manipulating the commands.

Keywords :

hidden Markov models; speech synthesis; HMM; accent type; command manipulation; fundamental frequency contour generation; fundamental frequency contour representation; generation process model; speech synthesis; word boundary; Frequency synthesizers; Hidden Markov models; Mathematical model; Pragmatics; Speech; Speech synthesis; HMM-based speech synthesis; flexible control; fundamental frequency contour; generation process model; linguistic information;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Machine Learning for Signal Processing (MLSP), 2011 IEEE International Workshop on

Conference_Location :

Santander

ISSN :

1551-2541

Print_ISBN :

978-1-4577-1621-8

Electronic_ISBN :

1551-2541

Type :

conf

DOI :

10.1109/MLSP.2011.6064596

Filename :

6064596

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2132493