• DocumentCode
    1749766
  • Title

    Trainable speech synthesis with trended hidden Markov models

  • Author

    Dines, John ; Sridharan, Sridha

  • Author_Institution
    Sch. of Electr. & Electron. Syst. Eng., Queensland Univ. of Technol., Brisbane, Qld., Australia
  • Volume
    2
  • fYear
    2001
  • fDate
    2001
  • Firstpage
    833
  • Abstract
    We present a trainable speech synthesis system that uses the trended hidden Markov model to generate the trajectories of spectral features of synthesis units. The synthesis units are trained from a transcribed continuous speech corpus, making the speech more natural than that produced by conventional diphone synthesisers which are generally, trained from a highly articulated speech database and require a large investment of time and effort in order to train a new voice. The,overall system has been incorporated into a PSOLA synthesiser to produce speech that is natural sounding and preserves the identity of the source speaker
  • Keywords
    hidden Markov models; spectral analysis; speech synthesis; HMM; PSOLA synthesiser; natural sounding speech; source speaker identity; spectral features; synthesis units; trainable speech synthesis system; trajectories generation; transcribed continuous speech corpus; trended hidden Markov model; Context modeling; Databases; Hidden Markov models; Industrial training; Laboratories; Maximum likelihood linear regression; Speech synthesis; Synthesizers; Systems engineering and theory; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2001. Proceedings. (ICASSP '01). 2001 IEEE International Conference on
  • Conference_Location
    Salt Lake City, UT
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-7041-4
  • Type

    conf

  • DOI
    10.1109/ICASSP.2001.941044
  • Filename
    941044