مرکز منطقه ای اطلاع رساني علوم و فناوري - Linear dynamical models in speech synthesis

DocumentCode :

177504

Title :

Linear dynamical models in speech synthesis

Author :

Tsiaras, Vassilis ; Maia, Ranniery ; Diakoloukas, Vassilis ; Stylianou, Yannis ; Digalakis, Vassilios

Author_Institution :

Sch. of Electron. & Comput. Eng., Tech. Univ. of Crete, Chania, Greece

fYear :

2014

fDate :

4-9 May 2014

Firstpage :

300

Lastpage :

304

Abstract :

Hidden Markov models (HMMs) are becoming the dominant approach for text-to-speech synthesis (TTS). HMMs provide an attractive acoustic modeling scheme which has been exhaustively investigated and developed for many years. Modern HMM-based speech synthesizers have approached the quality of the best state-of-the-art unit selection systems. However, we believe that statistical parametric speech synthesis has not reached its potential, since HMMs are limited by several assumptions which do not apply to the properties of speech. We, therefore, propose in this paper to use Linear Dynamical Models (LDMs) instead of HMMs. LDMs can better model the dynamics of speech and can produce a naturally smoother trajectory of the synthesized speech. We perform a series of experiments using different system configurations to check on the performance of LDMs for speech synthesis. We show that LDM-based synthesizers can outperform HMM-based ones in terms of cepstral distance and are a very promising acoustic modeling alternative for statistical parametric TTS.

Keywords :

hidden Markov models; speech synthesis; HMM; HMM based speech synthesizers; Hidden Markov models; LDM; TTS; acoustic modeling; acoustic modeling scheme; cepstral distance; linear dynamical models; smoother trajectory; speech synthesis; state-of-the-art unit selection systems; statistical parametric TTS; text-to-speech synthesis; Cepstral analysis; Hidden Markov models; Mathematical model; Speech; Speech synthesis; Trajectory; Kalman filter; Linear dynamical model; Statistical parametric speech synthesis;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on

Conference_Location :

Florence

Type :

conf

DOI :

10.1109/ICASSP.2014.6853606

Filename :

6853606

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=177504