Title :
Multi-source based acoustic model for speech synthesis
Author :
Jianhua, Tao ; Yongguo, Kang
Author_Institution :
Inst. of Autom., Chinese Acad. of Sci., Beijing, China
fDate :
31 Aug.-4 Sept. 2004
Abstract :
Traditional source-filter model has obvious limitation for speech synthesis in pitch modification due to the lack of spectrum distortion processing. To solve the problem, the paper compares spectrum features of voice source in various F0 ranges and timbres in detail, and generates multi-source (MS) based acoustic model for speech generation in various prosodies and timbres, by classifying and reconstructing voice source into different types. The model enhances the quality of speech synthesis even with strong changing of the speaking mood. It is important for future research on personalized and embedded speech synthesis system.
Keywords :
speech synthesis; embedded speech synthesis system; multisource based acoustic model; personalized speech synthesis system; pitch modification; source-filter model; speaking mood; spectrum distortion processing; speech generation; speech synthesis quality; Acoustic distortion; Automation; Bandwidth; Cepstral analysis; Filters; Frequency; Laboratories; Mood; Pattern recognition; Speech synthesis;
Conference_Titel :
Signal Processing, 2004. Proceedings. ICSP '04. 2004 7th International Conference on
Print_ISBN :
0-7803-8406-7
DOI :
10.1109/ICOSP.2004.1452740