DocumentCode
3009404
Title
A study on the influence of prosody and excitation source model on synthetic speech
Author
Cotescu, Marius ; Gavat, Inge
Author_Institution
Appl. Electron. & Inf. Technol. Dept., Univ. Politeh. of Bucharest, Bucharest, Romania
fYear
2010
fDate
10-12 June 2010
Firstpage
127
Lastpage
130
Abstract
The paper presents a study regarding two methods for improving the naturalness of synthesized speech. We have modeled the excitation source for an LPC vocoder as an impulse train which is passed through a filter to be formed into the excitation signal. The delay between two impulses can be constant, or it can be modulated by the pitch contour extracted from the original utterance. A Glottal Pulse Filter is extracted from the LPC residual so that its frequency response best fits the spectrum of the residual. Four excitation generators were implemented: two unfiltered and two filtered impulse generators. Synthetic speech obtained using the four generators were evaluated and scored by a group of ten people. Festival voices were also evaluated for reference.
Keywords
linear predictive coding; speech synthesis; vocoders; LPC vocoder; excitation source; glottal pulse filter; impulse generators; pitch contour; prosody; synthetic speech; Speech; LPC; Speech synthesis; excitation source model; pitch contour; prosody;
fLanguage
English
Publisher
ieee
Conference_Titel
Communications (COMM), 2010 8th International Conference on
Conference_Location
Bucharest
Print_ISBN
978-1-4244-6360-2
Type
conf
DOI
10.1109/ICCOMM.2010.5509049
Filename
5509049
Link To Document