DocumentCode :
1416844
Title :
Concatenative synthesis based on a harmonic model
Author :
O´Brien, Darragh ; Monaghan, A.I.C.
Author_Institution :
Sun Microsyst. Inc., Dublin, UK
Volume :
9
Issue :
1
fYear :
2001
fDate :
1/1/2001 12:00:00 AM
Firstpage :
11
Lastpage :
20
Abstract :
One of the most successful approaches to synthesizing speech, concatenative synthesis, combines recorded speech units to build full utterances. However, the prosody of the stored units is often not consistent with that of the target utterance and must be altered. Furthermore, several types of mismatch can occur at unit boundaries and must be smoothed. Thus, both pitch and time-scale modification techniques as well as smoothing algorithms play a crucial role in such concatenation based systems. In this paper, we describe novel approaches to each of these issues. First, we present a conceptually simple technique for pitch and time-scale modification of speech. Our method is based upon a harmonic coding of each speech frame, and operates entirely within the original sinusoidal model. Crucially, it makes no use of “pitch pulse onset times.” Instead, phase coherence, and thus shape invariance, is ensured by exploiting the harmonic relation existing between the sine waves used to code each analysis frame so that their phases at each synthesis frame boundary are consistent with those derived during analysis. Secondly, a smoothing algorithm, aimed specifically at correcting phase mismatches at unit boundaries, is described. Results are presented showing our prosodic modification techniques to be highly suitable for use within a concatenative speech synthesizer
Keywords :
speech coding; speech synthesis; analysis frame; concatenative speech synthesizer; concatenative synthesis; harmonic coding; harmonic model; harmonic relation; mismatch; phase coherence; pitch modification; prosody; recorded speech units; shape invariance; sine waves; smoothing algorithms; speech frame; synthesis frame boundary; time-scale modification; unit boundaries; Coherence; Databases; Harmonic analysis; Pulse shaping methods; Shape measurement; Smoothing methods; Spectral shape; Speech coding; Speech synthesis; Synthesizers;
fLanguage :
English
Journal_Title :
Speech and Audio Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1063-6676
Type :
jour
DOI :
10.1109/89.890067
Filename :
890067
Link To Document :
بازگشت