Title :
Precise tone generation for Vietnamese text-to-speech system
Author :
Do, Tu Trong ; Takara, Tomio
Author_Institution :
Dept. of Inf. Eng., Univ. of the Ryukyus, Okinawa, Japan
Abstract :
We propose a Vietnamese text-to-speech (VieTTS) system which is a parametric and rule based speech synthesis system. The fundamental speech units of this system are demisyllables with level tone. VieTTS uses a source-filter model for speech production and a log magnitude approximation (LMA) filter as the vocal tract filter. We chose the Hanoi dialect for VieTTS. Tone synthesis of Vietnamese is implemented by using fundamental frequency (F0) patterns and power pattern control. F0 is the most important factor in Vietnamese tone synthesis and the power control strongly affects broken and drop tones. Applying power control for tone synthesis is effective and unique for Vietnamese compared to other tonal languages such as Chinese and Thai.
Keywords :
knowledge based systems; natural languages; power control; speech synthesis; F0 patterns; Hanoi dialect; Vietnamese TTS system; Vietnamese text-to-speech system; broken tones; demisyllables; drop tones; fundamental frequency patterns; level tone; log magnitude approximation filter; parametric system; power pattern control; rule based system; source-filter model; speech synthesis; tone generation; tone synthesis; vocal tract filter; Control system synthesis; Databases; Filters; Frequency synthesizers; Natural languages; Power control; Power system modeling; Speech analysis; Speech processing; Speech synthesis;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
Print_ISBN :
0-7803-7663-3
DOI :
10.1109/ICASSP.2003.1198828