DocumentCode :
437057
Title :
Multi-source based acoustic model for speech synthesis
Author :
Jianhua, Tao ; Yongguo, Kang
Author_Institution :
Inst. of Autom., Chinese Acad. of Sci., Beijing, China
Volume :
1
fYear :
2004
fDate :
31 Aug.-4 Sept. 2004
Firstpage :
621
Abstract :
Traditional source-filter model has obvious limitation for speech synthesis in pitch modification due to the lack of spectrum distortion processing. To solve the problem, the paper compares spectrum features of voice source in various F0 ranges and timbres in detail, and generates multi-source (MS) based acoustic model for speech generation in various prosodies and timbres, by classifying and reconstructing voice source into different types. The model enhances the quality of speech synthesis even with strong changing of the speaking mood. It is important for future research on personalized and embedded speech synthesis system.
Keywords :
speech synthesis; embedded speech synthesis system; multisource based acoustic model; personalized speech synthesis system; pitch modification; source-filter model; speaking mood; spectrum distortion processing; speech generation; speech synthesis quality; Acoustic distortion; Automation; Bandwidth; Cepstral analysis; Filters; Frequency; Laboratories; Mood; Pattern recognition; Speech synthesis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal Processing, 2004. Proceedings. ICSP '04. 2004 7th International Conference on
Print_ISBN :
0-7803-8406-7
Type :
conf
DOI :
10.1109/ICOSP.2004.1452740
Filename :
1452740
Link To Document :
بازگشت