DocumentCode
2262457
Title
Phone-based speech synthesis with neural network and articulatory control
Author
Lo, W.K. ; Ching, P.C.
Author_Institution
Dept. of Electron. Eng., Chinese Univ. of Hong Kong, Shatin, Hong Kong
Volume
4
fYear
1996
fDate
3-6 Oct 1996
Firstpage
2227
Abstract
The paper presents a novel method for synthesizing a speech signal using a phone-based concatenation approach. A neural network is employed for the generalization of the phone templates during synthesis. Simplified articulatory space input parameters based on a modified vowel diagram are used to provide flexible and effective articulatory control. It also enables the design of an articulatory control model for allophonic variations in the speech signal. The network approach is chosen for its non-linear mapping of the relationship between the articulatory space parameters and the spectral information of speech signal. In addition, non-linear approximation for phone template transitions is facilitated. The phone templates of the synthesizer are implicitly stored as network parameters of a medium size network. The performance of this new speech synthesis technique is demonstrated with a prototype system specifically designed for Cantonese (a common Chinese dialect) and the synthetic speech quality is assessed by informal listening tests
Keywords
backpropagation; feedforward neural nets; natural languages; neural net architecture; speech intelligibility; speech synthesis; Cantonese; Chinese dialect; allophonic variations; articulatory control; articulatory control model design; generalized phone templates; informal listening tests; medium size network; modified vowel diagram; network parameters; neural network; nonlinear approximation; nonlinear mapping; phone-based concatenation approach; phone-based speech signal synthesis; simplified articulatory space input parameters; spectral information; synthetic speech quality; Controllability; Humans; Network synthesis; Neural networks; Signal mapping; Signal synthesis; Speech coding; Speech processing; Speech synthesis; Synthesizers;
fLanguage
English
Publisher
ieee
Conference_Titel
Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
Conference_Location
Philadelphia, PA
Print_ISBN
0-7803-3555-4
Type
conf
DOI
10.1109/ICSLP.1996.607248
Filename
607248
Link To Document