• DocumentCode
    3009404
  • Title

    A study on the influence of prosody and excitation source model on synthetic speech

  • Author

    Cotescu, Marius ; Gavat, Inge

  • Author_Institution
    Appl. Electron. & Inf. Technol. Dept., Univ. Politeh. of Bucharest, Bucharest, Romania
  • fYear
    2010
  • fDate
    10-12 June 2010
  • Firstpage
    127
  • Lastpage
    130
  • Abstract
    The paper presents a study regarding two methods for improving the naturalness of synthesized speech. We have modeled the excitation source for an LPC vocoder as an impulse train which is passed through a filter to be formed into the excitation signal. The delay between two impulses can be constant, or it can be modulated by the pitch contour extracted from the original utterance. A Glottal Pulse Filter is extracted from the LPC residual so that its frequency response best fits the spectrum of the residual. Four excitation generators were implemented: two unfiltered and two filtered impulse generators. Synthetic speech obtained using the four generators were evaluated and scored by a group of ten people. Festival voices were also evaluated for reference.
  • Keywords
    linear predictive coding; speech synthesis; vocoders; LPC vocoder; excitation source; glottal pulse filter; impulse generators; pitch contour; prosody; synthetic speech; Speech; LPC; Speech synthesis; excitation source model; pitch contour; prosody;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Communications (COMM), 2010 8th International Conference on
  • Conference_Location
    Bucharest
  • Print_ISBN
    978-1-4244-6360-2
  • Type

    conf

  • DOI
    10.1109/ICCOMM.2010.5509049
  • Filename
    5509049