DocumentCode
3528924
Title
Optimal learning of P-Layer additive F0 models with cross-validation
Author
Sakai, Shinsuke ; Kawahara, Tatsuya ; Shimizu, Tohru ; Nakamura, Satoshi
Author_Institution
Nat. Inst. of Inf. & Commun. Technol.
fYear
2009
fDate
19-24 April 2009
Firstpage
4245
Lastpage
4248
Abstract
In this paper, we present the derivation of the backfitting training algorithms for generic p-layer additive F0 models for arbitrary positive integer p. We have presented the special cases of the algorithms with p = 2 and p = 3 that have been successfully applied to the modelings of Japanese and English F0 contours, whereas the derivation of the algorithm was presented only for the two-layer case. The additive F0 model have smoothing parameters that establish a trade-off between the fit to the training data and the smoothness of the fitted curves, which have been all set to unity in the previous works. In this paper, we also present an optimal approach to set the values of these parameters using cross validation. We performed the training using the Boston University Radio News Corpus and confirmed the effectiveness of the proposed method.
Keywords
curve fitting; natural language processing; speech synthesis; Boston University Radio News Corpus; backfitting training algorithms; fitted curves smoothness; speech synthesis; training data; Communications technology; Frequency synthesizers; Informatics; Natural languages; Runtime; Smoothing methods; Speech synthesis; Statistical learning; Training data; additive models; fundamental frequency; intonation modeling; speech synthesis; statistical learning;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
Conference_Location
Taipei
ISSN
1520-6149
Print_ISBN
978-1-4244-2353-8
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2009.4960566
Filename
4960566
Link To Document