DocumentCode :
3109319
Title :
Transformation of F0 contours for lexical tones in concatenative speech synthesis of tonal languages
Author :
Trung-Nghia Phung ; Mai Chi Luong ; Akagi, Masato
Author_Institution :
Japan Adv. Inst. of Sci. & Technol., Ishikawa, Japan
fYear :
2012
fDate :
9-12 Dec. 2012
Firstpage :
129
Lastpage :
134
Abstract :
Concatenative speech synthesis (CSS) provides the greatest naturalness. However, it requires a huge stored database resulting a huge footprint. Reducing the capacity of stored database while preserving the quality of CSS, or improving the quality to size ratio (QSr), is still a challenge. In this paper, we propose a method of transforming fundamental frequency (F0) contours of lexical tones, developed from TD-GMM framework that successfully applied for transforming spectral sequence in previous researches, in order to improve the QSr of CSS of tonal languages that results CSS available with limited data at offline stage, storing small online footprint, while preserving perceptual quality. The experimental results show that the proposed F0 transformation outperforms conventional and state-of-the-art F0 contour transformations for transforming lexical tones in terms of speech quality. When applying the proposed F0 contour transformation for transforming lexical tones in CSS of tonal languages, the QSr is enhanced compared with the method of simple F0 exchange while the quality of synthetic speech is preserved.
Keywords :
natural language processing; speech synthesis; transforms; CSS quality preservation; F0 contour transformation; QSr improvement; TD-GMM framework; concatenative speech synthesis; contour fundamental frequency transformation; database storage capacity reduction; lexical tones; offline stage; online footprint storage; perceptual synthetic speech quality preservation; quality-to-size ratio improvement; tonal languages; Cascading style sheets; Databases; Speech; Speech synthesis; Training; Transforms; Vectors; Concatenative; quality to size ratio; speech synthesis; tone transformation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Speech Database and Assessments (Oriental COCOSDA), 2012 International Conference on
Conference_Location :
Macau
Print_ISBN :
978-1-4673-2811-1
Electronic_ISBN :
978-1-4673-2812-8
Type :
conf
DOI :
10.1109/ICSDA.2012.6422458
Filename :
6422458
Link To Document :
بازگشت