• DocumentCode
    294648
  • Title

    Automatic speech synthesiser parameter estimation using HMMs

  • Author

    Donovan, R.E. ; Woodland, P.C.

  • Author_Institution
    Dept. of Eng., Cambridge Univ., UK
  • Volume
    1
  • fYear
    1995
  • fDate
    9-12 May 1995
  • Firstpage
    640
  • Abstract
    This paper presents a new approach to speech synthesis which uses a set of decision tree state clustered triphone HMMs to automatically segment a single speaker speech database into sub-word units suitable for use in a synthesiser. Parameters are then obtained for each of these sub-word units from the segmented database, enabling a basic synthesis system to be constructed. This automatic generation of synthesis parameters means that the system can easily be retrained on a new speaker, whose voice it then mimics. It also means that a very large number of sub-word units can be used, which enables more precise context modelling than was previously possible
  • Keywords
    hidden Markov models; parameter estimation; speech synthesis; trees (mathematics); automatic speech synthesiser parameter estimation; context modelling; decision tree state clustered triphone HMM; segmented database; single speaker speech database; sub-word units; Automation; Context modeling; Data engineering; Databases; Decision trees; Hidden Markov models; Parameter estimation; Speech recognition; Speech synthesis; Training data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on
  • Conference_Location
    Detroit, MI
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-2431-5
  • Type

    conf

  • DOI
    10.1109/ICASSP.1995.479679
  • Filename
    479679