Precise tone generation for Vietnamese text-to-speech system

Author

Do, Tu Trong ; Takara, Tomio

Author_Institution

Dept. of Inf. Eng., Univ. of the Ryukyus, Okinawa, Japan

Volume

1

fYear

2003

fDate

6-10 April 2003

Abstract

We propose a Vietnamese text-to-speech (VieTTS) system which is a parametric and rule based speech synthesis system. The fundamental speech units of this system are demisyllables with level tone. VieTTS uses a source-filter model for speech production and a log magnitude approximation (LMA) filter as the vocal tract filter. We chose the Hanoi dialect for VieTTS. Tone synthesis of Vietnamese is implemented by using fundamental frequency (F0) patterns and power pattern control. F0 is the most important factor in Vietnamese tone synthesis and the power control strongly affects broken and drop tones. Applying power control for tone synthesis is effective and unique for Vietnamese compared to other tonal languages such as Chinese and Thai.

Keywords

knowledge based systems; natural languages; power control; speech synthesis; F0 patterns; Hanoi dialect; Vietnamese TTS system; Vietnamese text-to-speech system; broken tones; demisyllables; drop tones; fundamental frequency patterns; level tone; log magnitude approximation filter; parametric system; power pattern control; rule based system; source-filter model; speech synthesis; tone generation; tone synthesis; vocal tract filter; Control system synthesis; Databases; Filters; Frequency synthesizers; Natural languages; Power control; Power system modeling; Speech analysis; Speech processing; Speech synthesis;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on

ISSN

1520-6149

Print_ISBN

0-7803-7663-3

Type

conf

DOI

10.1109/ICASSP.2003.1198828

Filename

1198828