Title :
Unit selection in a concatenative speech synthesis system using a large speech database
Author :
Hunt, Andrew J. ; Black, Alan W.
Author_Institution :
ATR Interpreting Telecommun. Res. Labs., Kyoto, Japan
Abstract :
One approach to the generation of natural-sounding synthesized speech waveforms is to select and concatenate units from a large speech database. Units (in the current work, phonemes) are selected to produce a natural realisation of a target phoneme sequence predicted from text which is annotated with prosodic and phonetic context information. We propose that the units in a synthesis database can be considered as a state transition network in which the state occupancy cost is the distance between a database unit and a target, and the transition cost is an estimate of the quality of concatenation of two consecutive units. This framework has many similarities to HMM-based speech recognition. A pruned Viterbi search is used to select the best units for synthesis from the database. This approach to waveform synthesis permits training from natural speech: two methods for training from speech are presented which provide weights which produce more natural speech than can be obtained by hand-tuning
Keywords :
Viterbi decoding; search problems; speech synthesis; Viterbi decoding; concatenative speech synthesis system; database unit; large speech database; natural speech; natural-sounding synthesized speech; phoneme sequence; phonetic context information; prosodic context information; pruned Viterbi search; state occupancy cost; state transition network; synthesis database; training; transition cost; waveform synthesis; Control system synthesis; Costs; Databases; Laboratories; Natural languages; Network synthesis; Speech recognition; Speech synthesis; State estimation; Viterbi algorithm;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1996. ICASSP-96. Conference Proceedings., 1996 IEEE International Conference on
Conference_Location :
Atlanta, GA
Print_ISBN :
0-7803-3192-3
DOI :
10.1109/ICASSP.1996.541110