DocumentCode :
1118182
Title :
The Contribution of Various Sources of Spectral Mismatch to Audible Discontinuities in a Diphone Database
Author :
Klabbers, Esther ; Van Santen, Jan P H ; Kain, Alexander
Author_Institution :
OGI Sch. of Sci. & Eng., Oregon Health & Sci. Univ., Beaverton, OR
Volume :
15
Issue :
3
fYear :
2007
fDate :
3/1/2007 12:00:00 AM
Firstpage :
949
Lastpage :
956
Abstract :
One of the major problems in concatenative synthesis is the occurrence of audible discontinuities between two successive concatenative units. Several studies have attempted to discover objective distance measures that predict the audibility of these discontinuities. In this paper, we investigate mid-vowel joins for three vowels with a range of post-vocalic consonant contexts typical for diphone databases. A first perceptual experiment uses a pairwise comparison procedure to find two subsets of unit combinations: Those with versus without audible discontinuities. A second perceptual experiment uses these two subsets in a procedure where formant resynthesis is used to manipulate three sources of discontinuity separately: formant frequencies, formant bandwidths, and overall energy. Results show mismatch in formant frequencies provides the largest contribution to audible discontinuity, followed by mismatch in overall energy
Keywords :
spectral analysis; speech intelligibility; speech synthesis; audible discontinuities; concatenative synthesis; diphone database; diphone databases; formant bandwidths; formant frequencies; formant resynthesis; objective distance measures; pairwise comparison procedure; post-vocalic consonant contexts; spectral mismatch; Bandwidth; Concatenated codes; Cost function; Databases; Frequency; Natural languages; Optimization methods; Speech processing; Speech synthesis; Synthesizers; Audible discontinuities; diphones; spectral distance measures; speech synthesis;
fLanguage :
English
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1558-7916
Type :
jour
DOI :
10.1109/TASL.2006.885250
Filename :
4100687
Link To Document :
بازگشت