DocumentCode :
3186451
Title :
Speech synthesis for a specific speaker based on a labeled speech database
Author :
Hoory, R. ; Chazan, D.
Author_Institution :
Dept. of Electr. Eng., Technion-Israel Inst. of Technol., Haifa, Israel
fYear :
1994
fDate :
9-13 Oct 1994
Firstpage :
146
Abstract :
This paper proposes a new text-to-speech synthesis technique, for producing continuous, natural sounding speech of a specific speaker. The synthesis technique is based on selecting short speech frames from a phoneme-labeled speech database. The selection procedure involves minimization of a distortion criterion, by a dynamic programming algorithm. The proposed scheme is more flexible than many existing schemes using fixed speech segments, such as diphones. It results in a more natural synthesized speech. An efficient speech representation is used to express simply and accurately the spectral continuity of speech. A further improvement in the database search mechanism and in database size was obtained by sectioning the speech phonemes into “steady-states” and “transitions”. The resulting synthesized speech quality, is satisfactory and preserves the natural voice of the speaker
Keywords :
speech synthesis; database search mechanism; distortion criterion; dynamic programming; labeled speech database; minimization; spectral continuity; speech phoneme sectioning; speech representation; text to speech synthesis; Assembly; Databases; Dynamic programming; Heuristic algorithms; Loudspeakers; Minimization methods; Natural languages; Speech analysis; Speech synthesis; Stability;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Pattern Recognition, 1994. Vol. 3 - Conference C: Signal Processing, Proceedings of the 12th IAPR International Conference on
Conference_Location :
Jerusalem
Print_ISBN :
0-8186-6275-1
Type :
conf
DOI :
10.1109/ICPR.1994.577142
Filename :
577142
Link To Document :
بازگشت