• DocumentCode
    396797
  • Title

    New objective distance measures for spectral discontinuities in concatenative speech synthesis

  • Author

    Vepa, J. ; King, Simon ; Taylor, Phil

  • Author_Institution
    University of Edinburgh
  • fYear
    2002
  • fDate
    11-13 Sept. 2002
  • Firstpage
    223
  • Lastpage
    226
  • Abstract
    The quality of unit selection based concatenative speech synthesis mainly depends on how well two successive units can be joined together to minimise the audible discontinuities. The objective measure of discontinuity used when selecting units is known as the join cost. The ideal join cost measures perceived discontinuity, based on easily measurable spectral properties of the units being joined, in order to ensure smooth and natural-sounding synthetic speech. In this paper we describe a perceptual experiment conducted to measure the correlation between subjective human perception and various objective spectrally-based measures proposed in the literature. Also we report new objective distance measures derived from various distance metrics based on these spectral features, which have good correlation with human perception to concatenation discontinuities. Our experiments used a state-of-the art unit-selection text-to-speech system: rVoice from Rhetorical Systems Limited.
  • Keywords
    feature extraction; spectral analysis; speech synthesis; Rhetorical Systems Limited; audible discontinuities; concatenative speech synthesis; join cost; objective distance measures; objective spectrally-based measures; perceptual experiment; rVoice; spectral discontinuities; spectral features; subjective human perception; text-to-speech system; unit selection; Art; Cepstral analysis; Costs; Humans; Lattices; Power measurement; Spatial databases; Speech synthesis; System testing; Viterbi algorithm;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Speech Synthesis, 2002. Proceedings of 2002 IEEE Workshop on
  • Print_ISBN
    0-7803-7395-2
  • Type

    conf

  • DOI
    10.1109/WSS.2002.1224414
  • Filename
    1224414