• DocumentCode
    178934
  • Title

    Decision tree usage for incremental parametric speech synthesis

  • Author

    Baumann, Tommy

  • Author_Institution
    Dept. of Inf., Univ. Hamburg, Hamburg, Germany
  • fYear
    2014
  • fDate
    4-9 May 2014
  • Firstpage
    3819
  • Lastpage
    3823
  • Abstract
    Human speakers plan and deliver their utterances incrementally, piece-by-piece, and it is obvious that their choice regarding phonetic details (and the details´ peculiarities) is rarely determined by globally optimal solutions. In contrast, parametric speech synthesizers use a full-utterance context when optimizing vocoding parameters and when determing HMM states. Apart from being cognitively implausible, this impedes incremental use-cases, where the future context is often at least partially unavailable. This paper investigates the `locality´ of features in parametric speech synthesis voices and takes some missing steps towards better HMM state selection and prosody modelling for incremental speech synthesis.
  • Keywords
    decision trees; hidden Markov models; speech synthesis; vocoders; HMM state selection; decision tree usage; full-utterance context; human speakers; incremental parametric speech synthesis; parametric speech synthesizers; vocoding parameters; Cepstral analysis; Context; Decision trees; Hidden Markov models; Pragmatics; Speech; Speech synthesis; HMM Synthesis; Incremental Processing; Interactivity; Speech Synthesis; Spoken Dialogue Systems;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
  • Conference_Location
    Florence
  • Type

    conf

  • DOI
    10.1109/ICASSP.2014.6854316
  • Filename
    6854316