• DocumentCode
    1995188
  • Title

    Prosody Modeling from Tone to Intonation in Chinese using a Functional F0 Model

  • Author

    Ni, Jinfu ; Sakai, Shinsuke ; Shimizu, Tohru ; Nakamura, Satoshi

  • fYear
    2008
  • fDate
    15-16 Dec. 2008
  • Firstpage
    397
  • Lastpage
    404
  • Abstract
    Chinese is a tonal language. It has both lexical tones and intonation. The fundamental frequency (F0) contours thereby consist of tone and intonation components. This paper presents an approach to modeling the two components in separate ways and combining them to form the final F0 contours based on a functional F0 model. We analyze tonal patterns as sparse target points (tonal F0 peaks and valleys) and model them using classification and regression trees (CART) with contextual linguistic features. As a first step, we stylize expressive intonation using a few piecewise linear patterns specified by a few markup tags. Both tonal and intonational patterns are represented in a parametric form within the framework of this F0 model. Our experimental results indicated that very low F0 prediction errors were achieved by the CART-based modeling of the tonal patterns uttered by two female and male speakers. In a listening test, the native speakers could identify 90% of synthesized stimuli with enhancing emphasis in word. Also, the linguistic features related to the lexical tone context and distinction between voiced and unvoiced initials played the most important role in characterizing the tonal patterns.
  • Keywords
    regression analysis; speaker recognition; trees (mathematics); CART-based modeling; Chinese intonation; classification and regression trees; contextual linguistic features; fundamental frequency contours; lexical tone context; Classification tree analysis; Communications technology; Context modeling; Frequency; Hidden Markov models; Natural languages; Pattern analysis; Piecewise linear techniques; Regression tree analysis; Speech synthesis; CART; Chinese; F0 model; Prosody modeling; intonation; speech synthesis; tone;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Universal Communication, 2008. ISUC '08. Second International Symposium on
  • Conference_Location
    Osaka
  • Print_ISBN
    978-0-7695-3433-6
  • Type

    conf

  • DOI
    10.1109/ISUC.2008.37
  • Filename
    4724492