مرکز منطقه ای اطلاع رساني علوم و فناوري - Multi-source based acoustic model for speech synthesis

DocumentCode :

437057

Title :

Multi-source based acoustic model for speech synthesis

Author :

Jianhua, Tao ; Yongguo, Kang

Author_Institution :

Inst. of Autom., Chinese Acad. of Sci., Beijing, China

Volume :

fYear :

2004

fDate :

31 Aug.-4 Sept. 2004

Firstpage :

621

Abstract :

Traditional source-filter model has obvious limitation for speech synthesis in pitch modification due to the lack of spectrum distortion processing. To solve the problem, the paper compares spectrum features of voice source in various F0 ranges and timbres in detail, and generates multi-source (MS) based acoustic model for speech generation in various prosodies and timbres, by classifying and reconstructing voice source into different types. The model enhances the quality of speech synthesis even with strong changing of the speaking mood. It is important for future research on personalized and embedded speech synthesis system.

Keywords :

speech synthesis; embedded speech synthesis system; multisource based acoustic model; personalized speech synthesis system; pitch modification; source-filter model; speaking mood; spectrum distortion processing; speech generation; speech synthesis quality; Acoustic distortion; Automation; Bandwidth; Cepstral analysis; Filters; Frequency; Laboratories; Mood; Pattern recognition; Speech synthesis;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Signal Processing, 2004. Proceedings. ICSP '04. 2004 7th International Conference on

Print_ISBN :

0-7803-8406-7

Type :

conf

DOI :

10.1109/ICOSP.2004.1452740

Filename :

1452740

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=437057