DocumentCode :
3641893
Title :
Sources of increased variability in HMM synthetic voices
Author :
Marius Cotescu;Inge Gavat
Author_Institution :
Applied Electronics and Information Engineering Department, University “
fYear :
2011
fDate :
5/1/2011 12:00:00 AM
Firstpage :
1
Lastpage :
6
Abstract :
The paper presents a study on the effect of different methods of coding the STRAIGHT aperiodicity coefficients and models of the vocal tract on the quality of synthetic speech generated using HMMs. Three different coding schemes were implemented in the HTS synthesis system: the classic coding of the mean value in five frequency sub-bands, Mel-cepstral coefficients, and a simple unit selection method. The effect of removing the energy and spectral tilt from the speech spectrum, and modeling them independently from the vocal tract was also studied. Five systems were trained using the ARCTIC_SLT database to test the proposed methods. The synthetic voices were evaluated in three subjective listening tests.
Keywords :
"Hidden Markov models","Encoding","Correlation","Cepstral analysis","Speech","Feature extraction","Speech synthesis"
Publisher :
ieee
Conference_Titel :
Speech Technology and Human-Computer Dialogue (SpeD), 2011 6th Conference on
Print_ISBN :
978-1-4577-0440-6
Type :
conf
DOI :
10.1109/SPED.2011.5940735
Filename :
5940735
Link To Document :
بازگشت