DocumentCode
3361511
Title
Analysis of Vietnamese tones to optimize database in speech synthesis using unit selection method
Author
Vu Duc Lung ; Nguyen Phuoc Loc ; Cao Van Hung ; Nguyen Viet Quoe
Author_Institution
Fac. of Comput. Eng., Univ. of Inf. Technol., Ho Chi Minh City, Vietnam
fYear
2012
fDate
12-15 Dec. 2012
Abstract
This paper presents a novel approach to optimize data in Vietnameses speech synthesis using Unit Selection method. First, we conduct analysis of Vietnamese tone using Fujisaki model to find out the parameters of fundamental frequency contours (F0 contours) influencing on Vietnamese vowels while speech is expressed. Next, analysis, testing, and evaluation of the effects on the vowel are performed. After that, the data of unit selection speech synthesis system is optimized by recording the vowel with a level tone. As a result, when a Vietnamese word is synthesized, it will be synthesized with a level vowel first followed by being adjusted the parameter of F0 contour to create a word with appropriate tone. With this approach, recorded data can be reduced up to 64,44% while sound quality is insignificantly affected.
Keywords
natural language processing; speech synthesis; Fujisaki model; Vietnamese tones analysis; Vietnamese vowels; Vietnamese word; database optimization; fundamental frequency contours; sound quality; speech analysis; speech evaluation; speech synthesis; speech testing; unit selection method; vowel effect; Accuracy; Equations; Speech synthesis; Tin; F0 contours; Fujisaki Model; Vietnamese speech synthesis; data optimization; fundamental frequency; level tone; unit selection;
fLanguage
English
Publisher
ieee
Conference_Titel
Signal Processing and Information Technology (ISSPIT), 2012 IEEE International Symposium on
Conference_Location
Ho Chi Minh City
Print_ISBN
978-1-4673-5604-6
Type
conf
DOI
10.1109/ISSPIT.2012.6621258
Filename
6621258
Link To Document