Title :
Speech synthesis for text-to-speech alignment and prosodic feature extraction
Author :
Malfrere, F. ; Dutoit, T.
Author_Institution :
TCTS Lab., Fac. Polytech. de Mons, Belgium
Abstract :
The aim of this paper is to present a new and promising approach of the text-to-speech alignment problem. For this purpose, an original idea is developed: a high quality digital speech synthesizer is used to create a reference speech pattern used during the alignment process. The system has been used and tested to extract the prosodic features of read French utterances. The results show a segmentation error rate of about 8%. This system will be a powerful tool for the automatic creation of large prosodically labeled databases and for research on automatic prosody generation
Keywords :
feature extraction; speech processing; speech synthesis; French utterances; automatic prosody generation; high quality digital speech synthesizer; large prosodically labeled databases; prosodic feature extraction; reference speech pattern; segmentation error rate; speech synthesis; text-to-speech alignment; Context modeling; Databases; Error analysis; Feature extraction; Hidden Markov models; Power generation; Speech processing; Speech synthesis; Synthesizers; System testing;
Conference_Titel :
Circuits and Systems, 1997. ISCAS '97., Proceedings of 1997 IEEE International Symposium on
Print_ISBN :
0-7803-3583-X
DOI :
10.1109/ISCAS.1997.612866