مرکز منطقه ای اطلاع رساني علوم و فناوري - Multi-speaker prosodic instance selection for HMM-based speech synthesis

DocumentCode :

3430524

Title :

Multi-speaker prosodic instance selection for HMM-based speech synthesis

Author :

Yansuo Yu ; Fengyun Zhu ; Xihong Wu

Author_Institution :

Key Lab. of Machine Perception (Minist. of Educ.), Peking Univ., Beijing, China

fYear :

2013

fDate :

6-10 July 2013

Firstpage :

142

Lastpage :

146

Abstract :

In this paper, we propose a novel parametric speech synthesis based on prosodic instance selection to improve the naturalness of synthesized speech especially in the case of small database. Prosodic instances including F₀ and duration are directly selected from the database to preserve rich prosodic variations other than generation from the statistical models. Considering that spectral and prosodic parameters could be modeled separately, prosodic instances from multiple speakers, which are easier to obtain than those of single speaker, are exploited to not only enhance the prosodic models but also enrich the coverage of prosodic context for the synthesized speaker. The results of subjective listening tests on the corresponding databases further show that the proposed method can achieve better performance than both parametric synthesis and waveform concatenation synthesis.

Keywords :

hidden Markov models; speech synthesis; HMM-based speech synthesis; hidden Markov model; multispeaker prosodic instance selection; novel parametric speech synthesis; parametric synthesis; prosodic variations; statistical models; subjective listening tests; waveform concatenation synthesis; Context modeling; Databases; Hidden Markov models; Mathematical model; Speech; Speech synthesis; Training; HMM-Based Speech Synthesis; Prosodic Instance Selection (PIS);

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Signal and Information Processing (ChinaSIP), 2013 IEEE China Summit & International Conference on

Conference_Location :

Beijing

Type :

conf

DOI :

10.1109/ChinaSIP.2013.6625315

Filename :

6625315

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3430524