Title :
Prediction of F0 contours from symbolic and numerical variables using continuous conditional random fields
Author :
Fernandez, Raul ; Minnis, Steve ; Ramabhadran, Bhuvana
Author_Institution :
IBM TJ Watson Res. Center, Yorktown Heights, NY, USA
Abstract :
Regression of continuous-valued variables as a function of both categorical and continuous predictors arises in some areas of speech processing, such as when predicting prosodic targets in a text-to-speech system. In this work we investigate the use of Continuous Conditional Random Fields (CCRF) to conditionally predict F0 targets from a series of s symbolic and numerical predictive features derived from text. We derive the training equations for the model using a Least-Squares-Error criterion within a supervised framework, and evaluate the proposed system using this objective criterion against other baseline models that can handle mixed inputs, such as regression trees and ensemble of regression trees.
Keywords :
feature extraction; least squares approximations; random processes; regression analysis; speech synthesis; CCRF; F0 contour prediction; F0 target prediction; categorical predictors; continuous conditional random fields; continuous predictors; continuous-valued variable regression; least squares error criterion; mixed input handling; numerical predictive features; numerical variables; objective criterion; speech processing; supervised framework; symbolic predictive features; symbolic variables; text-to-speech system; training equations; Feature extraction; Numerical models; Predictive models; Regression tree analysis; Speech; Training; Vegetation; F0 prediction; conditional regression; speech synthesis;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location :
Kyoto
Print_ISBN :
978-1-4673-0045-2
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2012.6288948