Title :
Parametric speech synthesis using local and global sparse Gaussian processes
Author :
Koriyama, Tomoki ; Nose, Takashi ; Kobayashi, Takehiko
Author_Institution :
Interdiscipl. Grad. Sch. of Sci. & Eng., Tokyo Inst. of Technol., Tokyo, Japan
Abstract :
This paper describes an application of Gaussian process regression (GPR) to parametric speech synthesis. GPR enables us to predict synthetic speech parameters by utilizing exemplars of training speech data directly without converting the acoustic features of training data into too small number of model parameters thanks to nonparametric Bayesian regression. However, GPR inherently requires high computational cost and resources. In this paper, to alleviate this problem, we incorporate local and global sparse Gaussian process approximation into the statistical speech synthesis framework, and investigate trade-off between computational cost and speech synthesis performance through experiments. Moreover, we examine the way of choosing pseudo data set used for the sparse GP approximation.
Keywords :
Bayes methods; Gaussian processes; approximation theory; regression analysis; speech processing; speech synthesis; GPR; Gaussian process regression; computational cost; global sparse Gaussian process approximation; local sparse Gaussian process approximation; nonparametric Bayesian regression; parametric speech synthesis prediction; pseudodata set; sparse GP approximation; statistical speech synthesis framework; Approximation methods; Ground penetrating radar; Hidden Markov models; Kernel; Speech; Speech synthesis; Training; Gaussian process regression; parametric speech synthesis; partially independent conditional (PIC) approximation;
Conference_Titel :
Machine Learning for Signal Processing (MLSP), 2014 IEEE International Workshop on
Conference_Location :
Reims
DOI :
10.1109/MLSP.2014.6958921