Title :
Using Hybrid HMM-Based Speech Segmentation to Improve Synthetic Speech Quality
Author :
Mporas, Iosif ; Lazaridis, Alexandros ; Ganchev, Todor ; Fakotakis, Nikos
Author_Institution :
Dept. of Electr. & Comput. Eng., Univ. of Patras, Rion, Greece
Abstract :
The automatic phonetic time-alignment of speech databases is essential for the development cycle of a text-to-speech (TTS) system. Furthermore, the quality of the synthesized speech signals is strongly related to the precision of the produced alignment. In the present work we study the performance of a new HMM-based speech segmentation method. The method is based on hybrid embedded and isolated-unit trained models, and has proved to improve the phonetic segmentation accuracy in the multiple speaker task. Here it is employed on the single speaker segmentation task, utilizing a Greek-speech database. The evaluation of the method showed significant improvement in terms of phonetic segmentation accuracy as well as in the perceptual quality of synthetic speech, when compared to the baseline system.
Keywords :
hidden Markov models; natural language processing; speech processing; speech synthesis; Greek-speech database; automatic phonetic time-alignment; hidden Markov model; hybrid HMM-based speech segmentation; isolated-unit trained models; multiple speaker task; single speaker segmentation task; speech synthesis; synthesized speech signals; synthetic speech quality; text-to-speech system; Artificial intelligence; Data engineering; Databases; Hidden Markov models; Informatics; Laboratories; Signal synthesis; Speech analysis; Speech synthesis; Wire;
Conference_Titel :
Informatics, 2009. PCI '09. 13th Panhellenic Conference on
Conference_Location :
Corfu
Print_ISBN :
978-0-7695-3788-7
DOI :
10.1109/PCI.2009.25