Title :
Automatic speech synthesiser parameter estimation using HMMs
Author :
Donovan, R.E. ; Woodland, P.C.
Author_Institution :
Dept. of Eng., Cambridge Univ., UK
Abstract :
This paper presents a new approach to speech synthesis which uses a set of decision tree state clustered triphone HMMs to automatically segment a single speaker speech database into sub-word units suitable for use in a synthesiser. Parameters are then obtained for each of these sub-word units from the segmented database, enabling a basic synthesis system to be constructed. This automatic generation of synthesis parameters means that the system can easily be retrained on a new speaker, whose voice it then mimics. It also means that a very large number of sub-word units can be used, which enables more precise context modelling than was previously possible
Keywords :
hidden Markov models; parameter estimation; speech synthesis; trees (mathematics); automatic speech synthesiser parameter estimation; context modelling; decision tree state clustered triphone HMM; segmented database; single speaker speech database; sub-word units; Automation; Context modeling; Data engineering; Databases; Decision trees; Hidden Markov models; Parameter estimation; Speech recognition; Speech synthesis; Training data;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on
Conference_Location :
Detroit, MI
Print_ISBN :
0-7803-2431-5
DOI :
10.1109/ICASSP.1995.479679