DocumentCode
294648
Title
Automatic speech synthesiser parameter estimation using HMMs
Author
Donovan, R.E. ; Woodland, P.C.
Author_Institution
Dept. of Eng., Cambridge Univ., UK
Volume
1
fYear
1995
fDate
9-12 May 1995
Firstpage
640
Abstract
This paper presents a new approach to speech synthesis which uses a set of decision tree state clustered triphone HMMs to automatically segment a single speaker speech database into sub-word units suitable for use in a synthesiser. Parameters are then obtained for each of these sub-word units from the segmented database, enabling a basic synthesis system to be constructed. This automatic generation of synthesis parameters means that the system can easily be retrained on a new speaker, whose voice it then mimics. It also means that a very large number of sub-word units can be used, which enables more precise context modelling than was previously possible
Keywords
hidden Markov models; parameter estimation; speech synthesis; trees (mathematics); automatic speech synthesiser parameter estimation; context modelling; decision tree state clustered triphone HMM; segmented database; single speaker speech database; sub-word units; Automation; Context modeling; Data engineering; Databases; Decision trees; Hidden Markov models; Parameter estimation; Speech recognition; Speech synthesis; Training data;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on
Conference_Location
Detroit, MI
ISSN
1520-6149
Print_ISBN
0-7803-2431-5
Type
conf
DOI
10.1109/ICASSP.1995.479679
Filename
479679
Link To Document