DocumentCode
730769
Title
Prosody generation using frame-based Gaussian process regression and classification for statistical parametric speech synthesis
Author
Koriyama, Tomoki ; Kobayashi, Takao
Author_Institution
Interdiscipl. Grad. Sch. of Sci. & Eng., Tokyo Inst. of Technol., Yokohama, Japan
fYear
2015
fDate
19-24 April 2015
Firstpage
4929
Lastpage
4933
Abstract
This paper proposes novel models of F0 contours and phone durations using Gaussian process regression and classification (GPR and GPC) for statistical parametric speech synthesis. Although the use of frame-based GPR has shown the effectiveness of spectral feature modeling in previous studies, the application of GPR to prosodic features, i.e., F0 and phone duration, was not investigated sufficiently because the kernel function was designed for phonetic information only. In this paper, therefore, we propose a kernel function available for multiple units such as syllables, moras, and accent phrases. The proposed kernel function is based on temporal acoustic events like the beginning of accent phrase and the relative position between the target frame and the event is utilized for the kernel function. Experimental results of objective and subjective tests show that the GPR/GPC-based F0 and duration modeling improves the prediction accuracy of acoustic features compared with HMM-based speech synthesis.
Keywords
Gaussian processes; regression analysis; speech synthesis; Gaussian process classification; accent phrase; accent phrases; frame based Gaussian process regression; kernel function; phone durations; phonetic information; prosody generation; spectral feature modeling; statistical parametric speech synthesis; target frame; temporal acoustic; Context; Context modeling; Ground penetrating radar; Hidden Markov models; Kernel; Speech; Speech synthesis; Gaussian process classification; Gaussian process regression; Statistical parametric speech synthesis; kernel function; prosody;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
Conference_Location
South Brisbane, QLD
Type
conf
DOI
10.1109/ICASSP.2015.7178908
Filename
7178908
Link To Document