مرکز منطقه ای اطلاع رساني علوم و فناوري - Deep mixture density networks for acoustic modeling in statistical parametric speech synthesis

DocumentCode :

178942

Title :

Deep mixture density networks for acoustic modeling in statistical parametric speech synthesis

Author :

Heiga Zen ; Senior, Alan

fYear :

2014

fDate :

4-9 May 2014

Firstpage :

3844

Lastpage :

3848

Abstract :

Statistical parametric speech synthesis (SPSS) using deep neural networks (DNNs) has shown its potential to produce naturally-sounding synthesized speech. However, there are limitations in the current implementation of DNN-based acoustic modeling for speech synthesis, such as the unimodal nature of its objective function and its lack of ability to predict variances. To address these limitations, this paper investigates the use of a mixture density output layer. It can estimate full probability density functions over real-valued output features conditioned on the corresponding input features. Experimental results in objective and subjective evaluations show that the use of the mixture density output layer improves the prediction accuracy of acoustic features and the naturalness of the synthesized speech.

Keywords :

neural nets; speech synthesis; statistical analysis; DNNs; SPSS; acoustic modeling; deep mixture density networks; deep neural networks; full probability density function estimation; mixture density output layer; naturally-sounding synthesized speech; objective evaluations; objective function; real-valued output features; statistical parametric speech synthesis; subjective evaluations; Acoustic distortion; Acoustics; Hidden Markov models; Pragmatics; Speech; Speech synthesis; Training data; Statistical parametric speech synthesis; deep neural networks; hidden Markov models; mixture density networks;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on

Conference_Location :

Florence

Type :

conf

DOI :

10.1109/ICASSP.2014.6854321

Filename :

6854321

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=178942